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Abstract 



Males and females are markedly different in their choice of college major. Two main 
reasons have been suggested for the gender gap; differences in innate abilities and 
differences in preferences. This paper addresses the question of how college majors are 
chosen, focusing on the underlying gender gap. Since observed choices may be consistent 
with many combinations of expectations and preferences, I use a unique data set of 
Northwestern University sophomores that contains the students’ subjective expectations 
about choice-specific outcomes. I estimate a choice model where selection of college 
major is made under uncertainty (about personal tastes, individual abilities, and 
realizations of outcomes associated with the choice of major). Enjoying coursework, 
finding fulfillment in potential jobs, and gaining the approval of parents are the most 
important determinants in the choice of college major. Males and females have similar 
preferences while in college, but their preferences diverge in terms of the workplace; 
Nonpecuniary outcomes at college are most important in the decisions of females, while 
pecuniary outcomes realized at the workplace explain a substantial part of the choice for 
males. I decompose the gender gap into differences in beliefs and preferences. Gender 
differences in beliefs about academic ability explain a small and insignificant part of the 
gap, a finding that allows me to rule out low self-confidence as a possible explanation for 
females’ underrepresentation in the sciences. Conversely, most of the gender gap is the 
result of differences in beliefs about enjoying coursework and differences in preferences. 

Key words; college majors, uncertainty, subjective expectations, preferences, gender 
differences, culture 
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1 Introduction 



The difference in choice of college majors between males and females is quite dramatic. In 1999-2000, among 
recipients of bachelor’s degrees in the United States, 13% of women majored in education compared to 4% 
of men, and only 2% of women majored in engineering compared to 12% of men (2001 Baccalaureate and 
Beyond Longitudinal Study). Figure 1 highlights the differences in gender composition of undergraduate 
majors of 1999-2000 bachelor’s degree recipients (see also Turner and Bowen, 1999; Dey and Hill, 2007). 

These markedly different choices in college major between males and females have significant economic and 
social impacts. Figure 2 shows that large earnings premiums exist across majors. For example, in 2000-2001, 
a year after graduation in the United States, the average education major employed full-time earned only 
60% as much as one who majored in engineering (for a discussion of earnings differences across majors, also 
see Garman and Loury, 1995; Arcidiacono, 2004). Paglin and Rufolo (1990) and Brown and Corcoran (1997) 
find that differences in major account for a substantial part of the gender gap in the earnings of individuals 
with several years of college education. Moreover, Xie and Shauman (2003) show that, controlling for major, 
the gap between men and women in their likelihood of pursuing graduate degrees and careers in science and 
engineering is smaller. The gender differences in choice of major have recently been at the center of hot 
debate on the reasons behind women’s under-representation in science and engineering (Barres, 2006). 

There are at least two plausible explanations for these differences. First, innately disparate abilities be- 
tween males and females may predispose each group to choose different fields (Kimura, 1999). However, 
studies of mathematically gifted individuals reveal differences in choices across gender, even for very talented 
individuals. For example, the Study of Mathematically Precocious Youth shows that mathematically talented 
women preferred careers in law, medicine, and biology over careers in physical sciences and engineering (Lu- 
binski and Benbow, 1992). Moreover, the gender gap in mathematics achievement and aptitude is small and 
declining (Xie and Shauman, 2003; Goldin et ah, 2006), and gender differences in mathematical achievement 
cannot explain the higher relative likelihood of majoring in sciences and engineering for males (Turner and 
Bowen, 1999; Xie and Shauman, 2003). These studies suggest gender differences in preferences as a second 
possible explanation for the gender gap in the choice of major. However, no systematic attempt has been 
made to study these preferences. 

In this paper, I estimate a choice model of college major in order to understand how undergraduates 
choose college majors and to explain the underlying gender differences. The choice of major is treated as 
a decision made under uncertainty- uncertainty about personal tastes, individual abilities, and realizations 
of outcomes related to choice of major. Such outcomes may include the associated economic returns and 
lifestyle as well as the successful completion of major. My choice model is closest in spirit to the theoretical 
model outlined in Altonji (1993), which treats education as a sequential choice made under uncertainty. In his 
dynamic model, the decisions about attending college, which field to major in, and dropping out are based on 
uncertain economic returns, personal tastes, and abilities. I, however, do not model the choice of college. The 
particular institutional setup in the Weinberg Gollege of Arts & Sciences (WGAS) at Northwestern University 
allows me to estimate a choice model of college major where the decision can be treated as dynamic. However, 
since individuals are assumed to maximize current expected utility, and they are surveyed at a time when 
they are actively thinking about choosing a major, a static choice model is estimated in this paper. 
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The standard economic literature on decisions made under uncertainty generally assumes that individuals, 
after comparing the expected outcomes from various choices, choose the option that maximizes their expected 
utility. Given the choice data, the goal is to infer the parameters of the utility function. However, the 
expectations of the individual about the choice-specific outcomes are also unknown. The approach prevalent 
in the literature overlooks the fact that subjective expectations may be different from objective probabilities, 
assumes that formation of expectations is homogeneous, makes nonverifiable assumptions on expectations, 
and uses choice data to infer decision rules conditional on maintained assumptions on expectations. However, 
this can be problematic since observed choices might be consistent with several combinations of expectations 
and preferences, and the list of underlying assumptions may not be valid (see Manski, 1993a, for a discussion 
of this inference problem in the context of how youth infer returns to schooling). To illustrate this, let us 
assume that only two majors exist. Let us further assume that it is easier to get a college degree in the first 
major, but that it offers lower-paying jobs than the second major. An individual choosing the first major is 
consistent with two underlying states of the world: (1) she cares only about getting a college degree, or (2) 
she values only the job prospects but wrongly believes that the first major will get her a high-paying job. If 
one observes only the choice, then clearly one cannot discriminate between the two possibilities. The solution 
to this identification problem is to use additional data on expectations to allow the researcher to separate the 
two possibilities, and that is precisely what I do. 

I have designed and conducted a survey to elicit subjective expectations from 161 Northwestern University 
sophomores regarding choice of major. The survey collects data on demographics and background information, 
data relevant for the estimation of the choice model, and open-ended responses intended to explore how 
individuals form expectations. 

In contrast to most studies on schooling choices that ignore uncertainty, I estimate a random utility model 
of college major choice allowing for heterogeneity in beliefs.^ My approach also differs from the existing 
literature by accounting for the non-pecuniary aspects of the choice. Fiorito and Dauffenbach (1982) and 
Easterlin (1995) highlight the importance of non-price determinants in the choice of majors. However, no 
study has jointly modeled the pecuniary and non-pecuniary determinants of the choice. My approach allows 
me to quantify the contributions of both pecuniary and non-pecuniary outcomes to the choice. Moreover, the 
model is rich enough to explain gender differences in choices. 

Average responses to questions eliciting subjective expectations match up with existing objective statistics 
for several questions, indicating that respondents answer meaningfully and seriously. However, respondents 
exhibit significant heterogeneity in their responses (both between and within genders), which underscores the 
importance of expectations data to conduct inference in settings with uncertainty. The current study does 
not analyze the subjective data in detail since that would abstract from the main goal of the paper. Interested 
readers are instead referred to Zafar (2008), which analyzes the subjective data. 

I estimate separate models for single-major choice and for double-major choice. The most important 
outcomes in the choice of single major are enjoying coursework, enjoying work at potential jobs, and gaining 
the approval of parents. Non-pecuniary outcomes explain about 45% of the choice behavior for males and 

^Literature on college majors has largely ignored the uncertainty associated with the various outcomes of the choice. Two notable 
empirical exceptions are Bamberger (1986) and Arcidiacono (2004). However, the former only takes into account the uncertainty about 
completing one’s field of study. The latter estimates a dynamic model of college and major choice under highly stylized assumptions on 
expectations formation. 
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more than three- fourths of the choice for females. Males and females have similar preferences at college, but 
differ in their preferences regarding the workplace: Males care more about the pecuniary outcomes in the 
workplace and females about the non-pecuniary outcomes. The results for the model of double-major choice 
are similar to those for single major. Graduating in four years, gaining approval of parents, and enjoying 
coursework are the most important determinants of the choice. Additionally, I find evidence of individuals 
strategically choosing pairs of majors that allow them to specialize along certain dimensions. Females prefer 
pairs of majors that entail different chances of completion and getting a job upon graduation. Males, on the 
other hand, prefer pairs of majors that differ in their chances of completion, in the approval of parents, and 
in how much they would enjoy the coursework. 

On the methodology side, this paper adds to the recent literature on subjective expectations (see Manski, 
2004, for an overview of this literature). In the last decade or so, economists have increasingly undertaken the 
task of collecting and describing subjective data. Studies have shown that subjective data tend to be good 
predictors of behavior. For example, Euwals et al. (1998) show that data on desired working hours is helpful in 
explaining female labor supply. Similarly, Hurd et al. (2004) show that subjective survival probabilities affect 
retirement behavior and timing of social security benefits claims in meaningful ways. Recently, expectations 
data have been employed to estimate decision models: Wolpin (1999), van der Klaauw (2000), and van der 
Klaauw and Wolpin (2008) show that incorporating subjective expectations data in choice models improve 
the precision of the parameter estimates. Delavande (2004) collects subjective data to estimate a model of 
birth control choice for women. The choice model used in this paper is motivated by her framework. My 
paper contributes to this literature by providing an extensive description of students’ expectations about 
major-specific outcomes and by using subjective expectations data to estimate a choice model. 

Second, this paper contributes to the recent literature on culture and economic outcomes (see Guiso et 
al., 2006; Fernandez et al., 2004). In order to establish a causal link from culture to economic outcomes, I 
focus on the dimension of culture that is inherited by an individual from previous generations, rather than 
being voluntarily selected. I use information on the country of origin of the individual’s parents as a cultural 
proxy. I find that cultural proxies bias preferences in favor of certain outcomes. Individuals with foreign-born 
parents value the pecuniary aspects of the choice more. In particular, males with foreign-born parents are 
the only sub-group in my sample for whom pecuniary outcomes explain more than 50% of the choice. 

Finally, this paper is related to the literature that focuses on the underlying reasons for the gender gap 
in science and engineering. An important question is whether gender differences in choices are driven by 
differences in preferences or in beliefs. In the recent debate on the under-representation of women in science 
and engineering, some authors have claimed that the gap may be driven by the fact that women are less 
self-confident about their academic abilities than men. Valian (1998) argues that social prejudice against 
women causes them to lose self-confidence. Indeed, Solnick (1995) finds that women are more likely to shift 
to other majors from traditionally female majors if they attend a women’s college. To check the validity of 
these hypotheses, I decompose the gender gap in major choice into differences in beliefs and differences in 
preferences. First, I find that gender differences in beliefs about ability constitute a small and insignificant 
part of the gap. This implies that explanations based entirely on the assumption that women have lower self- 
confidence relative to men (Long, 1986; Niederle and Vesterlund, 2007) can be rejected in my data. Second, 
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the majority of the gender gap in majors that I consider can be explained by gender differences in beliefs 
about tastes for studying different fields, and differences in preferences. For example, 60% of the gender gap in 
engineering is due to differences in preferences, while 30% is due to differences in how much females and males 
believe they will enjoy studying engineering. Gender differences in beliefs about future earnings in engineering 
are insignificant and explain less than 1% of the gap. I simulate an environment in which the female subjective 
belief distribution about ability and future earnings is replaced with that of males; in the case of engineering, 
this reduces the gap by only about 14%. These results suggest that simply raising expectations for women in 
science, as claimed by Valian (1998), may not be enough, and that wage discrimination and under-confidence 
with regard to academic ability may not be the main reasons why women are less likely to major in science 
and engineering. 

The paper is organized as follows. Section 2 outlines the choice model and the identification strategy. 
Section 3 describes the institutional setup of Weinberg College of Arts & Sciences, outlines the data collection 
methodology, and briefly describes the subjective data. Section 4 outlines the econometric framework used 
for estimation. Section 5 presents the estimation results for the model of single-major choice. Section 6 
summarizes the results for the model of double-major choice (details of the model and estimation are available 
in the online Appendix^). Section 7 undertakes a decomposition technique to understand the sources of gender 
differences in choice of major. Finally, Section 8 concludes. 

2 Choice Model 

At time t, individual i is confronted with the decision to choose a college major from her choice set Ci. 
Individuals are forward-looking, and their choice depends not only on the current state of the world but also 
on what they expect will happen in the future. Individual i derives utility c, ^it) from choosing major 

k. Utility is a function of a vector of outcomes a that are realized in college, a vector of outcomes c that 
are realized after graduating from college, and individual characteristics Xu. Examples of outcomes in a 
include graduating within four years, enjoying the coursework, and gaining approval of parents. Examples 
of outcomes in c include future income, number of hours spent at the job, and ability to reconcile family 
and work. Both vectors, a and c, are uncertain at time t; individual i possesses subjective beliefs Pjfcj(a, c) 
about the outcomes associated with choice of major k for all k £ Ci.^ If an individual chooses major m, 
then standard revealed preference argument (assuming that indifference between alternatives occurs with zero 
probability) implies that: 

m^argmax / [/jfct(a, c, Ait)dPjfci(a, c) (1) 

keCi J 

The goal is to infer the preference parameters from observed choices. However, the expectations of the 
individual about the choice-specific outcomes are also unknown. The most one can do is infer the decision rule 
conditional on the assumptions imposed on expectations. This would not be an issue if there were reasons to 
think that prevailing expectations assumptions are correct. However, not only has the information- processing 
rule varied considerably among studies of schooling behavior, but most assume that individuals form their 

^The online Appendix is available at www.newyorkfed.org/research/economists/zafar/pl_appendix.pdf. 

^Though each major has an objective probability for (a, c), there’s no reason to believe that subjective beliefs will be the same as the 
objective probabilities. 
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expectations in the same way.^ First, there is little reason to think that individuals form their expectations 
in the same way. Second, different combinations of preferences and expectations may lead to the same choice. 
Manski (2002) shows that different combinations of preferences and expectations (about others’ behavior) 
leads to the same actions in the ultimatum game. To cope with the problem of joint inference on preferences 
and expectations, I elicit subjective probabilities directly from individuals. An additional advantage of this 
approach is that it allows me to account for the non-pecuniary determinants of the choice (data that do not 
exist otherwise). 

The exact utility specification is outlined in section 4, which presents the econometric framework. I first 
describe the data collection methodology. 

3 Data 

I collected data on 161 Northwestern University sophomores. This section describes the institutional details 
at Northwestern, the data collection method, and the nature of the subjective data. 

3.1 Institutional Details 

Northwestern University requires students to declare their major by the end of their sophomore year. Since 
sophomores are actively thinking about their choice of major, this study is restricted to Northwestern Univer- 
sity sophomores. The study is further restricted to schools at Northwestern University that accord students 
flexibility in choosing a major. For example, a student in the School of Engineering has to declare her major 
at the time of admission and can change her major only by a special request to the school. This student 
would not be eligible for the study. I further assume that the choice set for an individual is exogenous. This 
eliminates students in smaller schools at Northwestern since I will have to make strong assumptions about 
their choice set. Therefore, I restrict the study to the Weinberg College of Arts & Sciences (WCAS) at 
Northwestern. All sophomores with at least one major in the WCAS were eligible for the study.^ 

3.1.1 Choice Set 

WCAS offers a total of 41 majors. To estimate the choice model, one needs to elicit the subjective probabilities 
of the outcomes for each major in one’s choice set. In order to limit the size of the choice set, I pool similar 
majors together. Table 1 shows the majors divided into various categories. Categories a through g span 
the majors offered in WCAS. Categories h through I span undergraduate majors offered by other schools at 
Northwestern University. There is a trade-off between the number of categories and the length of the survey. 
This categorization is fairly fine and also seems reasonable. 

For a student pursuing a single major in WCAS, it is assumed that her choice set includes all the categories 



^Freeman (1971) assumed that income expectation formation of college students is myopic, that is, the youth believe that they will obtain 
the mean income realized by the members of a specified earlier cohort who made that choice. Arcidiacono (2004), in his dynamic model of 
college and major choice, makes strong assumptions about various outcomes; for example, he assumes that youth condition their expectations 
of future earnings on their ability, GPA, average ability of other students enrolled in that college, and some demographic variables. Similarly, 
he assumes that all individuals have the same expectations about the probability of working, conditional on sex and major. The list of studies 
that explicitly (or implicitly) make assumptions about expectations formation is long, and there is no evidence that prevailing expectations 
assumptions are correct. 

^A student could have a second major in any other school. She could take part in the study as long as she was pursuing a major in 

WCAS. 
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that span WCAS majors {a-g), and category k, the majors offered in the School of Engineering.® Therefore, 
any student with a single major is assumed to have 8 categories in her choice set. 

For an individual with a double major, the choice set is conditional on whether both her majors are in 
WCAS and the School of Engineering, or not. Conditional on the student’s majors being in WCAS and the 
School of Engineering, the choice set is the same as that of a single major respondent except that the goal 
now is to select pairs of majors rather than a single one. Conditional on one of the majors being in a school 
other than WCAS or the School of Engineering, the choice set includes all major categories that span WCAS, 
category k, and the category which includes the student’s non-WCAS major. ^ 

3.2 Data Collection 

A sample of eligible sophomores and their e-mail addresses was provided by the Northwestern Office of 
the Registrar. Students were recruited by e-mail, and flyers were posted on campus in schools other than 
WCAS.® The e-mails and flyers explicitly asked for sophomores with an intended major in WCAS. Prospective 
participants were told that the survey was about the choice of college majors and that they would get $10 
for completing the 45-minute electronic survey. It was emphasized that students need not have declared 
their majors to participate in the study. The survey was conducted from November 2006 to February 2007. 
Respondents were required to come to the Kellogg Experimental Laboratory to take the electronic survey. 

A total of 161 WCAS sophomores were surveyed, of whom 92 were females. Table 2 shows how the charac- 
teristics of the sample compare with those of the sophomore class. The sample looks similar to the population 
in most aspects. However, a few differences stand out: (1) students of Asian ethnicity are overrepresented 
in my sample; (2) 61% of the respondents had declared their majors at the time of the survey, whereas the 
corresponding number for the sophomore population was only 18%. However, this statistic for the population 
was obtained at the beginning of the sophomore year. Since students may declare their majors at any time 
during the academic year, it is very likely that this statistic was greater than 18% for the population at the 
time of the survey; and (3) it seems that survey-takers, especially male students, have higher CPAs than their 
population counterparts. 

Table 3 presents the distribution of WCAS majors in the sample. For comparison, the major distribution 
for the graduating class of 2006 is also presented.® There are a few notable features. The proportion of males 
who (intend to) major in Social Sciences II is twice the corresponding proportion of women both in my sample 
and in the graduating class of 2006. This pattern is reversed in the case of Social Sciences I and Literature 
and Fine Arts. The proportion of females who (intend to) major in Literature and Fine Arts is more than 
three times the corresponding proportion of males. 

The 45- minute survey consisted of three parts. The first part collected demographic and background 
information (including parents’ and siblings’ occupations and college majors, source of college funding, etc.). 
The second part collected data relevant for the estimation of the choice model, and is discussed in more 
detail in the next subsection. The third part collected responses to open-ended questions intended to explore 

®This was done to elicit subjective beliefs about the outcomes associated with majoring in Engineering. 

'"For example, the choice set for a student with a major in WCAS and the School of Education would be categories a-g, i, and k. 

*E-mails advertising the survey were also sent out by WCAS undergraduate advisors, Economics professors teaching large core classes, 
and deans of some schools (other than WCAS). 

®This is the most recent year for which data are available. There is no reason to believe that the distribution of majors would stay the 
same over time. 
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how respondents form expectations about various major-specific outcomes and to identify the sources of 
information they used. At the end of the survey, respondents were asked if they were willing to participate 
in a follow-up survey in a year’s time. If the respondents agreed to the follow-up, they were asked for their 
names and contact information. An astounding 97 % (156 out of 161 ) respondents agreed to the follow-up. 

3.3 Subjective Data 

The subjective beliefs, Pjfcj(a, c) \/k G Ci, are elicited directly from the respondent. The vector a includes the 
outcomes: 

a\ successfully completing (graduating) a field of study in four years 

02 graduating with a GPA of at least 3.5 in the field of study 

03 enjoying the coursework 

04 hours per week spent on the coursework 

05 parents approve of the major 



while the vector c consists of: 

Cl get an acceptable job immediately upon graduation 
C2 enjoy working at the jobs available after graduation 
C3 able to reconcile work and family at the available jobs 
C4 hours per week spent working at the available jobs 
C5 social status of the available jobs 
cg income at the available jobs 

An individual’s choice of major might be motivated by several pecuniary and non-pecuniary concerns. 
An individual motivated primarily by future earnings prospects may choose a major that is associated with 
large income streams (cg), allows a high probability of getting a job upon graduation (ci), and increases 
the possibility of getting jobs with high social status (cg). An individual concerned about her ability may 
choose a major that presents a greater probability of completion (oi) and allows her to graduate with a higher 
GPA (02). On the other hand, an individual may choose a major with low-salary job prospects that allows 
a flexible lifestyle (03, C4) or provides opportunities to do things she enjoys (02). Similarly, an individual’s 
choice may be influenced by the kinds of courses she finds interesting (03) or by how demanding the major 
is (04). Finally, the choice may be influenced by parents and family ( og). Another interpretation of these 
outcomes is as follows: a\ and 02 are outcomes that capture ability in college; 03 can be interpreted as taste 
in college; and C2 and C3 may be interpreted as tastes in the workplace. 

Note that {or}r={i,2,3,5} {cg}g={i 2,3} are binary, while outcomes 04 and {cg}g={4 g^gj are continuous. 
For all k G Q, the survey elicited the probability of the occurrence of the binary outcomes, i.e., Pikt{ar = 1 ) 
for r = { 1 , 2 , 3 , 5 } and Pikt{cq = 1 ) for q = { 1 , 2 , 3 }. Expected value was elicited for the continuous outcomes, 
i.e., Eikt{aA) and Eikt{cq) for q = { 4 , 6 }. 



Questions eliciting the subjective probabilities of major-specific outcomes are based on the use of percent- 
ages. As is standard in studies that collect subjective data, a short introduction was read and handed to the 
respondents at the start of the survey: 
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"In some of the survey questions, you will be asked about the PERCENT CHANCE of some- 
thing happening. The pereent chanee must be a number between zero and 100. Numbers like 2 or 
5% indicate “almost no chance," 19% or so may mean “not much chance," a 41 or 55% chance 
may be a “pretty even chance," 82%o or so indicates a “very good chance," and a 95 or 98% mean 
“almost certain." The percent chance can also be thought of as the NUMBER OE CHANCES OUT 
OE 100. 

We will start with a couple of practice questions. " 

This introduction is similar to the one in the Survey of Economic Expectations (SEE), described in Do- 
minitz and Manski (1997). However, as in Delavande (2004), I do not round off the percentages. Eor example, 
I use 19% instead of 20% to encourage respondents to use the full range from zero to 100. Respondents had 
to answer two practice questions before starting the survey to make sure they understood how to answer 
questions based on the use of percentages. 

The questions dealing with subjective expectations were worded as follows: 

If you were majoring in [X], what do you think is the percent chance that you will graduate 
with a CPA of at least 3.5 (on a scale of 4)? 

and: 

Look ahead to when you will be 30 YEARS OLD. If you majored in [X], what do you think is 
the percent chance that you will be able to reconcile work and your social life/ family at the kinds 
of jobs that will be available to you? 

The question eliciting the expected number of hours per week spent on coursework was: 

If you were majoring in [X], how many hours per week do you think you will need to spend on 
the coursework? 

Social status of the available jobs was elicited as follows: 

Look ahead to when you will be 30 years old. Rank the following fields of study according to 
your perception of the social status of the jobs that would be available to you and that you would 
accept if you graduated from that field of study. 

Eor the expected income, the question was as follows:^^ 

Look ahead to when you will be 30 years old. Think about the kinds of jobs that will be available 
to you and that you will accept if you graduate in [X]. What is the average amount of money that 
you think you will earn per year by the time you are 30 YEARS OLD? 

In addition, I elicited the subjective belief of being active in the full-time labor force at the age of 30 and 
40, and E{Yq), the expected income at the age of 30 if one were to drop out of college. 

Questions eliciting beliefs about major-specific outcomes can be viewed in section 9.1 of the Appendix. 
The full questionnaire is available from the author on request. 

'^^This question elicits an ordinal ranking of the social status of the jobs. However, I treat these ordinal responses as cardinal in the choice 
model analysis. In hindsight, this question should have been asked in terms of subjective expectations of getting a high-status job. 

^^The wording of this question is very similar to that of Dominitz and Manski (1996), who elicit expectations of the returns to schooling 
from high school and college students. 




3.4 The Data 



Since the use of subjective data in economics is fairly recent, there is a genuine interest in analyzing the 
precision and accuracy of such data. However, given that the main goal of this paper is to estimate the 
preferences for college major choice by taking the subjective data as given, I don’t undertake this task here 
since it is not possible to summarize the data in a condensed form. Moreover, to address the extent to 
which cognitive biases (such as respondents making little mental effort in answering questions, cognitive 
dissonance, and social desirability) affect beliefs, one would need to observe how beliefs evolve over time. 
Interested readers are referred to Zafar (2008), which analyzes this set of issues by using a panel of beliefs 
elicited from the same sample. The companion paper shows that respondents provide meaningful answers to 
questions eliciting subjective expectations. In cases where responses could be compared to objective realities 
and statistics, on average, survey responses match up well. For example, individuals are aware of the earnings 
differences across majors. Moreover, Zafar (2008) does not find evidence of cognitive biases systematically 
affecting the way in which individuals report their beliefs. 

It should be pointed out that there is substantial heterogeneity in responses both between and within 
gender. In order to highlight the heterogeneity in beliefs across respondents, I discuss the response to a 
representative question. Table 4 presents the gender-specific subjective belief distribution of graduating with 
a GPA of at least 3.5 in Engineering and in Literature and Fine Arts. The table shows that respondents are 
willing to use the entire scale from zero to 100. It does seem that respondents tend to round off their responses 
to the nearest 5, especially for answers not at the extremes. There has been some concern that respondents 
might answer 50% when they want to respond to the interviewer, but are unable to make any reasonable 
probability assessment of the relevant question. However, the 50% response is not the most frequent one 
in the majority of the cases. There doesn’t seem to be any evidence of anchoring, since numbers that were 
presented in the introductory text do not occur more often than others. The substantial heterogeneity in 
beliefs questions the accuracy of restrictions imposed on expectations in the literature. 

4 Econometric Model 

This section outlines the econometric framework. 

Recall that utility, Uikt{si,c, Xu), is a function of a 5 x 1 vector of outcomes a realized in college, a 6 x 1 
vector of outcomes c realized after graduating from college, and individual characteristics Xu. The individual 
maximizes her current subjective expected utility^^; she chooses major m at time t if: 

m = argmax / Uikt{a,c, Xit)dPikt{si,c) (2) 

keCi J 

Moreover, as explained in section 3.3, the outcomes {or}r={i, 2 , 3 , 5 } {cq}g={i^ 2 , 3 } a-re binary, while outcomes 

04 , and {cq}g ={4 5 6} are continuous. I change the notation slightly and define b to be a 7 x 1 vector of all 
binary outcomes, i.e., b = {oi, 02, 03, 05, ci, C2, C3}, and d to be a 4 x 1 vector of all continuous outcomes, i.e., 
d = {04, C4, C5, cq }. The utility can now be written as a function of outcomes b, d, and characteristics Xu. I 
^^See Bruine de Bruin et al. (2000). This is what they call "epistemic uncertainty," or the "50-50 chance." 

^^Under the assumption that individuals maximize current expected utility, I don’t need to take into account that individuals may find it 
optimal to experiment with diff’erent majors. However, experimentation could be important in this context to learn about one’s ability and 
match quality (see Adalamud, 2006, and Stinebrickner and Stinebrickner, 2008). It is beyond the scope of this paper. 
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assume that utility is additively separable in the outcomes: 

7 4 

d, Xit) = 'y ^ Ur{bri Xit') + ^ ^ Hiqtdq ^ikt 
r=l g=l 

where Ur{br,Xit) is the utility associated with the binary outcome br for an individual with characteristics 
Xit, 'Jiqt is a constant for the continuous outcome dq, and Sikt is a random term. The utility is the same for 
all individuals with identical observable characteristics Xu up to the random term. Equation (2) can now be 
written as: 

7 4 

m = arg max (E / + ^ 

J 

r=l q=l 

An individual i with subjective beliefs {Pikt{br) , Pikt{dq)} for r G {1, ..,7}, q G {1, ..,4} and V/c G Ci chooses 
major m at time t with probability: 



'liqt J" dqdPikt(^dq^ T ) 



P^{'^\Xit, {Pikt{br), Pikt[dq)}r£{l,..,7},q£{l,..,4:}; k£Ci) ~ 



Pr 



f X^r=l S ^r{br , Xit)dPimt{br^ T ^iqt S dqdPimt{dq) T £imt \ 



\ ^ X>r=l J" ^r{br , Xit)dPikt{br) T Tigt J" dqdPikt{dq) T £ikt ) 



(3) 



VA: G Ci, m ^ k 

For the binary outcomes in b, Pimt{br) is simply Pimt(br = 1) for r G 7}; Pimt{br = 1) is elicited 
directly from the respondents for Vr G {1, .., 7} and \/k G Ci. For the continuous outcomes in d, instead of the 
probability distribution, the expected value of the outcome Eikt(dq) = f dqdPikt(dq) is elicited Vg G {1, ..,4}.^“^ 
Next, I explain how I compute the expected income. Since one must successfully complete the major to 
gain the associated earnings, Eiktid^), i's expected earnings associated with choice k at time t are: 

£,„(*) = |wdGU^,)\p.,.EMI) + (1 - for k,p 

where w is an indicator variable of the individual’s labor force status, Cit{w) is the subjective belief at time t 
about one’s labor force status at the age of 30, and pikt is individual i’s subjective probability at time t about 
successfully graduating in major k. The belief of being active in the labor force at the age of 30 is simply 
Cit{w = 1). J wdCit{w) = Git{w = 1), denoted as gu, is elicited directly from the respondents.^^ Conditional 
on being active in the labor force, with probability Pikt: the individual’s expected earnings are Eikt{I), the 
expected income associated with major k at the age of 30; with probability 1 — pikt, her expected earnings 
are Eit{Io), the expected income at the age of 30 if one were to drop out of school at time Equation (3) 



consequence of the linear utility specification is that the individual is risk-neutral, i.e., f Uit{y,h,d,Xit)dPikf{Y,h,d) = 
UitiJ Y-,h,d, XitdPikt{Y-,'^id)). Hence, I need to elicit only the expected value for the continuous outcomes. 

^'^Note that the underlying assumption is that expectation of being active in the labor force, gu., is independent of one’s field of study. This 
is a rather restrictive assumption since one’s decision to participate in the labor force may be influenced by the job opportunities available, 
which would be related to one’s field of study. Relaxing this assumption would have required me to ask this subjective expectation for each 
field of study in one’s choice set, and that would not have been feasible. 

^®In an earlier version of the model, I allow the individual to change fields of study once before dropping out of school. However, the 
results don’t seem to change much. 
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can now be written as: 



Pr(77l|^it, {Pikt{br) , (<^^5 ) }rG { 1, . . ,7} ,gG {1 , . . ,4}; kGCi) — 



Pr 



^ Pjmt(br — l)Ur{br — 1, Xu) + [1 — Pimt{br — l)]Ur(&r — 0, Xu)} + 'I iqt^imt{dq) + Eimt 



y > ^^,.— ]^{Pikt{br — l)Ur(br — 1, Xu) + [1 - Piktibr = l)]Ur{br = 0,Xu)} + 1 iqtEikt[dq) + Eikt j 



( 4 ) 



Vk £ Ci, m ^ k 

Moreover, PimtiW = l)ur{hr = I, Xu) + [1 - PimtiK = l)]ur(br = 0,Xit) is equivalent to Pimt{br = 
l)Xur{Xit) + Ur{hr = OjXjf), where Aur(Xu) = Ur{bj. = l,Xu) — Uribr = OjXjj), i.e., it is the differ- 
ence in utility between outcome hr happening and not happening for an individual with characteristics Xu- 
The expected utility that individual i derives from choosing major m at time t is: 



Uimtih,d,Xu,{Pimt{br = l)}J=i, {-E'imi(dg)}g=i) = 

( 5 ) 

~ X^r=l Pimt{br = ^)AUr{Xu) + Ylr=l '^r{hr = 0, Xu) + liqt^imt{dq) + ^imt 

Equation (4) can now be written as: 



P'^{jn\Xu,{Pikt{br)) Eikt{dq)^r£{l,..,7},q£{l,..,i}\ k£Ci) — 



Pr 



^ Xyr=l Pimt{br — ^)AUr{Xu) + '7 iqtdd'imt{dq) -f £imt '' 

\ 'Yhr=l^ikt{br = ^)AUr(yXu) + ^2q=l '1 iqt^^iktidq) P £ikt j 



( 6 ) 



Vk G Ci, m ^ k 

{Aur{Xu)]l=i, and {'yiqt)q=i are the parameters to be estimated, gu, {Pikt{br = l)}r=D {^ikt{dq)]\=i, 
and Eikt{I) VA: G Ci are elicited directly from the respondent. In order to ensure strict preferences between 
choices, {sikt} are assumed to have a continuous distribution. The exact parametric restrictions on the random 
terms required for identifying the model parameters are discussed in the next section. 



5 Single Major Choice Model 

This section deals with estimating the preferences for choice of single majors. I drop the time subscript in 
the analysis that follows. 

5.1 Estimation with Homogenous Preferences 

The model described in section 4 assumes that the utility function for the binary outcomes Ur{br, Xi) and the 
coefficients on continuous outcomes ({7iq}g=i) depend on individual characteristics. I initially assume that 
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the utility function does not depend on individual characteristics. Under this assumption, (6) becomes: 



Pr{m\Pik{br), Eik{dq)}r^{l,..J},q^{l,..,A}; k&Ci) 



= Pr 



/ Eim{br ~ 1) AUc + 1 qEim(dq) + £imt \ 



\ > Hl=l PikiK = l)AUc + Ylt=l lqEik{dq) + £ikt j 



'ik G Ci, m ^ k 

If I assume that the random terms {eikt} are independent for every individual i and choice k, and that they 
have a Type I extreme value distribution, then {sikt ~ £imt} has a standard logistic distribution. Then the 
probability that individual i chooses major m is: 



Pr(7u|{Pjfc(l>j.), 7} 4}; k^Ci) ("I) 

_ exp(X;Ll Pim.{br = l)AUr + X]g=l lqEim{dq)) 

'ZkGCi exp(Er=l P^k{br = + Eg=i lqEik{dq)) 

Under these parametric assumptions, the parameters {Attr}J=i, { 7 <j}g=iare identified. The elicited 
subjective probabilities described in section 3.2 are used in estimation. Column (1) of Table 5 presents the 
maximum likelihood estimates using stated choice data.^^’^® 

The relative magnitudes of {Attr}J =4 show the importance of the binary outcomes in the choice. The 
difference in utility levels is positive and largest for enjoying coursework. The second most important outcome 
in the choice is graduating within four years; it has a positive coefficient that is about half of the coefficient 
on enjoying coursework. The third most important factor is enjoying work at the available jobs, which has a 
positive coefficient of a magnitude similar to the coefficient on graduating within four years. The difference 
in utility levels is positive for parents’ approval and (surprisingly) negative for graduating with a GPA of at 
least 3.5. Both coefficients are significant and about one-fourth the coefficient on enjoying coursework. The 
difference in utility levels for reconciling family and work is about one-sixth in magnitude compared to that 
of enjoying coursework, but is surprisingly negative. The coefficient on the social status of the jobs is positive 
and significant. A unit increase in the social status of available jobs changes the utility by as much as a 5% 
increase in the probability of graduating in four years. The coefficient on hours per week spent at work is 
negative, but not significantly different from zero. Though the coefficient on income is negative, it is not 
significantly different from zero, suggesting that it is not important in the choice. 

Column (2) of Table 5 shows the maximum-likelihood estimates based on (7) with the addition of female 
interactions in order to get some measure of relative differences between males and females. For males, the 
difference in utility levels is largest for enjoying coursework, finding a job upon graduation, and the social 
status of the jobs (in decreasing order of importance). For females, the three outcomes that matter most are 

'^'"Some 44 of the 83 respondents with a single major had declared their major at the time of the survey. For the remaining 39, I use their 
stated intended choice for estimation. 

Moreover, a respondent with an adjunct major (see Table 1) has to have another major. For the purposes of estimation, I don’t 
differentiate between an adjunct major and a normal major. Such respondents are treated as pursuing a single major if both their majors 
are in the same category, and as pursuing double majors if they are in different categories. 
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graduating in four years, enjoying the coursework, and enjoying work at the available jobs. Though income 
stays insignificant, the coefficient on income interacted with the female dummy shows that the negative 
coefficient on income in column (1) is being driven by the preferences of females; income has a positive 
coefficient for males now and a negative one for females (though neither are significant). 

In addition to stating their choice, respondents were also asked to rank the elements in their choice set. The 
stated preference data provide more information that can be used for estimation of the model parameters. 
Under the assumptions of standard logit, the probability of any ranking of alternatives can be written as a 
product of logits. For example, consider the case where an individual’s choice set is {a,b,c,d}. Suppose 
she ranks the alternatives b, d, c, a from best to worst. Under the assumption that the are iid and 
Type I distributed, the probability of observing this preference ordering can be written as the product of 
the probability of choosing alternative b from {a, b, c, d}, the probability of choosing d from {a, c, d}, and the 
probability of choosing c from the remaining {a, c}. If Uij = j3xij+ Sij denotes the utility i gets from choosing 
j for j G {a, 6, c, d}, then the probability of observing b y d y cy a is simply:^^ 



Pr(6 y d y cy a) 



exp{Pxjb) exp{/3xid) exp{Pxic) 

Ej&{aAc4} Ej&{a,c} 



Column (3) in Table 5 presents the maximum-likelihood estimates using stated preference data. The difference 
in utility levels is still largest and positive for enjoying coursework. Graduating in four years, the second most 
important outcome using stated choice data, is now negative but not significant. Enjoying work at the jobs 
is the second most important outcome, with a positive coefficient. Approval of parents, now the third most 
important outcome, has a positive coefficient that is one-half that of enjoying coursework. The difference in 
utility levels for graduating with a GPA of at least 3.5 is now positive and significant. Status of the jobs 
continues to be important: A unit increase in the social status of the jobs changes the utility by as much 
as a 4% increase in the probability of enjoying coursework. The difference in utility levels for other binary 
outcomes is not significantly different from zero. The coefficient on income is now positive, but not significant. 

Golumn (4) allows female interaction dummies in order to gain further insight into gender differences in 
preferences. For both genders, the difference in utility levels is largest and positive for enjoying coursework. 
For males, graduating within four years is the second most important outcome, but surprisingly it has a 
negative sign. The third most important outcome for males is the difference in utility levels for graduating 
with a GPA of at least 3.5; it is positive and about half that of enjoying coursework. Status of the jobs 
remains important for males: A unit increase in the status of the jobs changes the utility by as much as a 10% 
increase in the probability of enjoying coursework. For females, two of the important outcomes are gaining 
approval of parents and enjoying work at the jobs. Both have a positive coefficient that is about two-thirds 
the magnitude of the coefficient on enjoying coursework. Graduating within four years and graduating with 
a GPA of at least 3.5 have coefficients that are positive and about one-third of the coefficient on enjoying 

'^®Kapteyn et al. (2007) use a similar approach; they use elicited (stated) preferences data to estimate preference parameters for retirement. 

^®One concern with using stated preference data is that an individual may not have complete preferences over all alternatives that are 
available to her. In the case that a complete ranking does not exist, it is possible that the lower end of her preferences is noise. To check the 
sensitivity of the results, the model was also estimated by using the ranking of the four most preferred choices only. The results (available 
from the author upon request) are comparable to those obtained from using the complete preference data. Therefore, I continue to use 
complete stated preference data in the analysis that follows. 

logit on ranked data is called exploded logit in the literature. This is because a ranking of J alternatives explodes into J — 1 pseudo- 
observations for estimation purposes. This expression results from the particular form of the extreme value distribution, first shown by Luce 
and Suppes (1965). 
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course work. 

In order to get a measure of the magnitude of the estimated parameters, the natural thing would be to do 
willingness-to-pay calculations, i.e. translate the differences in utility levels into the amount of earnings that 
an individual would be willing to forgo at the age of 30 in order to experience that outcome. However, since 
expected income at age 30 is not significant in any of the specifications considered, the standard errors on such 
calculations are huge, and the results are not very meaningful. Instead of presenting the willingness-to-pay 
calculations, I outline a different decomposition method to gain insight into the relative importance of the 
various outcomes in the choice. For illustration, suppose that Fv{choice = j) = F(Kj(3) and that X includes 
two variables, X\ and X 2 . Given the parameter estimates, (3i and /? 2 , the contribution of X\ to the choice is 
defined as: 



Mxi = II Pr(c/io*ce = j\ {/?i,/32} ~ Pr(c/iozce = j\ {/?i =0,d2} || (8) 

^yjchoice = j\ {Pi, ^ 2 }) _ Pr(c/ioice = j\ = O.dzl) 

2^ N ^ N 

i=l i=l J 

where the first term is the average probability of majoring in choice j predicted by the model, and the 
second term is the average predicted probability of majoring in j if outcome X\ were not considered. The 
difference in the two terms is a measure of the importance of X\ in the choice. Similarly, the contribution of 
X 2 is given as: 




Mx^ = 



\ 



E 

i=i 



^ Pr(c/iozce = j\ {Pi, P 2 }) ^ Vv(choice = j| {/3 ^, /?2 = 0}) 



Z=1 



N 



(9) 



The relative contribution of X\ to the choice is then = m^+Mx ' ^^^tiple parameters can be set to 
zero simultaneously to get a sense of their joint contribution to the choice. However, since the model is not 
linear, generally Mx^+X 2 / + ^X 2 - Table 6 presents the results of this decomposition strategy. Each 

cell shows the relative contribution (R) of the outcome to the choice. Panel B of Table 6 presents the results 
of this decomposition technique using the estimates obtained from stated preference data. Column (1) shows 
the decomposition results of the estimates of the pooled sample: Nearly three-fourths of the choice is driven by 
the non-pecuniary outcomes. If the decomposition is made finer, one can see that gaining parents’ approval 
and enjoying coursework jointly explain about 45% of the choice. Pecuniary outcomes associated with college 
(hours per week spent on coursework, graduating with a GPA of at least 3.5, and graduating in four years) 
and workplace (finding a job upon graduation, hours per week spent at work, income at the age of 30, and 
the social status of the jobs) each account for about 20% of the choice. 

The estimates of the pooled sample mask the differences between males and females. Columns (2) and 
(3) of Table 6 show the decomposition results using the estimates from the male sub-sample and the female 
sub-sample, respectively. Non-pecuniary outcomes explain about 45% of the choices for males, but more than 

^^For example, the amount that an individual would be willing to forgo in earnings at the age of 30 for a 2% change in the probability of 

^ . . 0.02 X Aui 

outcome ? is 

'' 74 

Outcomes classified as being non-pecuniary are gaining parents’ approval, enjoying coursework, reconciling work and family, and enjoying 
work at the jobs. The remaining outcomes are termed as being pecuniary. 
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80% of the choice for females. Gaining parents’ approval and enjoying coursework are the most important 
outcomes for females, explaining about 45% of their choice, while pecuniary outcomes associated with the 
workplace are of utmost importance to males, explaining 48% of their choice. Reconciling family and enjoying 
work at the available jobs are second in terms of importance to females, but of least importance to males. 
On the whole, non-pecuniary determinants are crucial in explaining the choices for both males and females. 
However, males and females differ in their preferences in the workplace: Males value pecuniary aspects of the 
workplace more, while females value non-pecuniary aspects of the workplace more. 

Table 7 presents the results of various thought experiments in an attempt to assess how changes in beliefs 
affect the choice of majors for males and females. The baseline case is presented first. For example, the 
model predicts that the average probability of majoring in Engineering for males is 11.7%, more than twice 
that for females. Experiments 1 through 3 show changes in predicted probabilities in response to changes 
in beliefs of outcomes that are well defined (for example, graduating with a GPA of at least 3.5). Predicted 
probabilities are not very responsive to changes in beliefs in these cases. Experiments 4 through 6 show results 
of thought experiments for outcomes that are not well defined. Eor example, experiment 5 shows that the 
average probability of majoring in Engineering increases by 20% for females, and by about 10% for males, in 
response to a 10% increase in beliefs of enjoying coursework in Engineering. The results in Table 7 indicate 
that outcomes like enjoying coursework and gaining approval of parents are crucial in one’s choice of major. 

5.1.1 Comparison with Stated Preferences 

It would be interesting to see how the estimated preference parameters compare with stated preferences. 
Survey respondents were asked to state their preferences for various determinants in their choice. More 
specifically, respondents were asked to assign an integer between zero and 100 to a list of reasons such that 
the numbers added up to a 100.^^ Table 8 shows the average weights assigned to the various reasons given 
by males and females. I interpret these numbers as the relative preference for the given reason in the choice 
of major. Enjoying work at the jobs and learning more about things that interest them were the two most 
important reasons for choosing a major for both males and females. However, females, on average, assign 
higher weights to these reasons (the gender difference is significant). Eor males, the third most important 
stated reason for choosing a major is getting a high-paying job. Gonversely, doing well in the coursework is 
the third most important reason for females. These stated preferences for various outcomes are consistent 
with the parameter estimates discussed above. Determinants like the fraction of people of the same gender 
taking classes in the major or working at the jobs don’t seem to be important. One surprising finding is 
that males, on average, are more likely than females to have been encouraged by a mentor or role model to 
choose a major (similarly, females assign lower weights to peer pressure, siblings making the same choice, and 
parents wanting them to make the choice; the gender difference is not significant for any of these reasons, 
though). The result in this study could be driven by the fact that the sample is restricted to students who 
have at least one major in the Gollege of Arts and Sciences (which contains majors mostly dominated by 
females) and therefore males who have a major in the Gollege are more likely to have been encouraged by a 
role model than are females. 

^^This question was asked in the follow-up survey (Zafar, 2008), which was taken by 117 of the 161 original survey takers. 

^®This finding is in contrast to the literature in social psychology, which finds that females are more influenceable (Eagly, 1978). 
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5.1.2 Understanding the Insignificance of Future Income in the Choice of Majors 



This section discusses some robustness checks in order to determine whether income is actually insignificant in 
the choice of major or if the result is driven by large standard errors. One concern could be that individuals 
are not aware of earnings differences across majors, and that factor could be driving the result. Table 9 
presents the average and median beliefs of the respondents. Since individuals majoring in a field may have 
better information about their chosen field and may have beliefs different from those of individuals not 
majoring in it, I split survey responses by whether the respondent majors in the category about which the 
question is asked. Since Northwestern University does not follow its alumni, I use the 2003 average annual 
salaries for 1993 college graduates from selective colleges in the Baccalaureate & Beyond Longitudinal Study 
(B&B: 1993/2003) for comparison purposes.^® These statistics are presented in columns (1) and (2) of Table 
9. The average and median beliefs of respondents majoring in the field are similar to those who do not 
major in that field. Survey respondents, both males and females, seem to be aware of income differences 
across majors. However, both report median and average salaries larger than those for the B&B sample. It 
could be that the survey respondents are self-enhancing their own salary expectations. However, there are 
at least three legitimate reasons why respondents’ earnings expectations may be different from the earnings 
statistics in the B&B sample. First, even though I have restricted the B&B sample to selective institutions. 
Northwestern graduates may work at jobs very different from those of graduates from comparable institutions. 
Second, respondents might think that future earnings distributions will differ from the current ones. Third, 
respondents may have private information (other than gender) about themselves that justifies having different 
expectations. Though the descriptive analysis of respondents’ expectations of income in different majors in 
Table 9 indicates that students are aware of the income differences across majors, the variation in their 
responses is much larger than in actual data (for males in particular) . This indicates that the insignificance of 
income might be driven by the noise in the reported expectations. I undertake the decomposition in equation 
(9) for 1,000 bootstrap samples for each of the sub-samples. The bootstrap confidence interval of for 
both males and females does not include zero: The higher end of the 90% bootstrap interval for expected 
income is 16% and 7.5% for males and females, respectively. This seems to suggest that 74 is insignificant 
because of a large standard error and not because it is a precise zero.^^ 

A second reason for the insignificance of income in the choice could in part be due to the risk-neutrality 
assumption embedded in the model specification. This assumption was made primarily so that it would suffice 
to elicit the expected value for the continuous outcomes. In the absence of this assumption, I would have 
had to elicit multiple points on the subjective income distribution for each major in one’s choice set (as in 
Dominitz and Manski, 1996), which would not have been feasible for the purposes of this study. Since several 
studies have concluded that women are more risk averse than men in their choices (see Eckel and Grossman, 
2002, and Croson and Gneezy, 2008, for a summary of the literature), results in the current study regarding 



Colleges with high selectivity and the same Carnegie Code classification as Northwestern were used for comparison. Assuming students 
graduate from college at the age of 22, this would be their salary at age 32. 

^^An additional robustness check that I did was to estimate the model using the ordinal ranking of income (instead of expected income). 
This allows me to control for the noise in the reported income expectations. The coefficient on (ranked) income is now significant for the 
males, but continues to be insignificant for females. Moreover, the confidence interval of is [3.8%, 29.2%] for males and [3.6%, 18.7%] 
for females. The overall contribution of income and social status, however, does not change since ranked income picks up a substantial part 
of the contribution of status toward the choice (ranked income and status are highly correlated). Therefore, none of the results change. 
However, this seems to suggest that income is at least significant for males. 
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gender differences in income preferences could be a consequence of the risk-neutrality assumption. 

5.2 Estimation with Heterogeneous Preferences 

The analysis undertaken in Zafar (2008) shows that beliefs for various outcomes are associated with demo- 
graphic characteristics and cultural proxies. However, it could be the case that preferences for the different 
outcomes also depend on individual characteristics. For example, if individuals have declining marginal utility 
of consumption, and preferences are separable in consumption and non-pecuniary outcomes, then the value 
of pecuniary outcomes will be higher for individuals from low-income households. Such heterogeneity, if not 
accounted for, may bias the estimates presented in section 5.1. Several empirical studies have documented 
the influence of family and society in the endogenous formation of preferences. For example, Fernandez et 
al. (2004) find that whether a male’s mother worked while he was growing up is correlated with whether his 
wife works, and they interpret this as preference transmission. Similarly, Guiso et al. (2006) present evidence 
of culture affecting individuals’ preferences. I now relax the assumption of section 5.1 that the utility for 
each binary outcome Ur{br) and the constants 7 ^ for the continuous outcomes do not depend on individual 
characteristics other than gender. Though I have relatively rich demographic information on the respondents, 
it is not possible to account for heterogeneity in all outcomes because of the small sample size. I therefore 
consider heterogeneity along the following dimensions: 

1 . An individual might care about her parents’ approval for several reasons. She might be more inclined 
to ensure that her parents approve of her choice if she relies on them for college support. Moreover, 
concern for parents’ approval might depend on the individual’s cultural and ethnic background. I allow 
for heterogeneity in the utility for approval of parents by incorporating the financial support an individual 
receives from her parents when in college and whether her parents are foreign-born or not. 

2. Children growing up in divorced or separated households make different choices than other individuals 
(Gruber, 2004). Here, I consider the effect of growing up in such a household on the individual’s 
preference for being able to reconcile work and family. 

3. An individual’s preference for the social status of jobs may vary by her cultural background. In certain 
cultures, immense importance is given to the status of jobs. This heterogeneity is accounted for by 
taking into account whether the individual’s parents are foreign-born. 

4. If non-pecuniary outcomes are a normal good, an individual from a low-income family will value the 
income profiles associated with the majors more than other individuals will. I account for this hetero- 
geneity by including information on parents’ annual income. I also allow for heterogeneity by taking 
into account whether an individual’s parents are foreign-born or not. 
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The enriched utility function for individual i is: 



U{Xi, {Pim{br) -I Pim{dq)}r£{l,.,7} ,q£{l,.,4:}) 

= Pim{br = l)Aur + Au 4 [parents’_supportj^ x {1-Foreign^) x Pim{bi = 1)] 

r={l,2,3,5,6,7} 



+Au 4 [parents support x Foreign^ x Pimibi = 1)] + Awj [divorcedi x Pijt{b 7 = 1)] 



+ J2l=ilqEim{dq) + 7 s [{l-Fordgn-) x Ei^ids)] + 73 [ForeigUi x £’*^(^ 3 )] + 74 -E'im(d 4 ) 



+ 7 f^^[-E'im(d 4 ) X {l-low_inc^) x (l-Forei^nj)] + 7^ ^[£^(^ 4 ) x {l-low_inc^) x Foreign^] 



+ ^4^ [Eimidi) X low_inc^x {I- Foreign ^)] + '^^^[Eim{d4) x low_inc^ x Foreignj\ + Sim 

V m = 1 , 8 

where low_ inc is a dummy variable that equals one if the individual’s parents earn less than $150,000 annually; 
parents support captures the financial support an individual receives from her parents,^® Foreign is a dummy 
that equals one if either of the individual’s parents is foreign-born, and divorced is a dummy that equals one 
if the individual’s parents are either separated or divorced. 

I continue to assume that the random terms {sik} are independent for every individual i and choice k. 
Table 10 presents the maximum-likelihood estimates of this model using stated preference data. Estimates 
from the pooled sample in column ( 1 ) show that difference in utility levels is still largest and positive for 
enjoying coursework and that the coefficient is almost unchanged from the specification with homogenous 
preferences. The coefficients of the outcomes for which heterogeneity is not considered stay almost the same 
as that in the earlier specification. With this enriched specification, the difference in utility levels for parents’ 
approval is 0.34 for individuals with US-born parents who do not receive college support from them, and 
2.04 for individuals who annually receive more than $25,000 in college support from their parents. This is 
consistent with the hypothesis that approval of parents matters more to individuals who depend on their 
parents for college funding. However, I don’t find support for this hypothesis for individuals with foreign- 
born parents. The difference in utility levels for reconciling work and family continues to be insignificant. 
Individuals with separated or divorced parents have a negative coefficient for reconciling work and family, but 
it is not significantly different from zero. Introducing heterogeneity for the status outcome gives an interesting 
result. Status of the available jobs, an important determinant in the choice in the earlier specifications, is not 
important to individuals with US-born parents. However, for individuals with foreign-born parents, a unit 
increase in the social status of the jobs changes the utility by as much as an 8 % increase in the probability 
of the most important outcome, which is enjoying coursework. This implies that the large positive coefficient 
on the social status of jobs in earlier specifications is being driven by the preferences of individuals with 

is increasing in the financial support an individual receives from her parents. Parents’ support = 1 if no education expenses are paid 
by one’s parents; equals 2 if they pay less than $5,000; equals 3 if they pay between $5,000 and $10,000; equals 4 if they pay between $10,000 
and $15,000; equals 5 if they pay between $15,000 and $25,000; and equals 6 if they pay more than $25,000. 
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foreign-born parents in the sample. The coefficient on income at age 30 is still not significantly different from 
zero. However, there is weak snpport for the hypothesis that individuals from low-income households value 
the future earnings profile more in their choice. 

Columns (2) and (3) of Table 10 present the results of the heterogeneous choice model for the male and 
female sub-samples, respectively. Coefficients of outcomes that are not interacted with any demographic 
variables are almost unchanged with respect to the corresponding specification (column 4 in Table 5). For 
males with US-born parents, difference in utility levels for approval of parents varies from 0.578 when receiving 
no support from parents to 3.47 when annually receiving more than $25,000 in support from them. The 
corresponding coefficient for females with US-born parents is only half the magnitude of that for males. The 
coefficient on parents’ approval for females with foreign-born parents is similar in magnitude to that of males 
with US-born parents. Surprisingly, the utility change in approval of parents for males with foreign-born 
parents is not significantly different from zero. On the other hand, social status of jobs matters only to males 
with foreign-born parents: A unit increase in the social status of the jobs changes the utility by about a 13% 
increase in the probability of enjoying coursework for these males. Earnings at the age of 30 are a significant 
determinant for males belonging to low-income families with foreign-born parents. 

In order to gain insight into the magnitude of these parameters. Table 11 shows the results of the decom- 
position methodology outlined in equation (9). Except for males with foreign-born parents, non-pecuniary 
attributes explain more than half of the choice. Eor individuals with US-born parents, more than two-thirds 
of the choice is driven by non-pecuniary motivations; the non-pecuniary outcomes at college are of utmost 
importance to this group. Eor individuals with foreign-born parents, pecuniary outcomes at the workplace 
are of greatest value in the choice for males, while non-pecuniary outcomes at college continue to be of utmost 
importance to such females. 

To recap the findings in this section, demographic characteristics bias preferences in favor of certain 
outcomes. Males with foreign-born parents are driven primarily by the pecuniary attributes when making 
their choice of college major, while the converse is true for all other groups. 



5.3 Parents’ Approval and Peer Effects 



Though section 5.2 shows that one channel through which parents’ approval matters is the individual’s 
reliance on them for college support, it is not clear which majors parents are more likely to approve or what 
criteria they use for approving a major. Since only the beliefs of students are observed, I can only study the 
relationship between students’ beliefs about parents’ approval of a major and their own beliefs about other 
outcomes associated with the choice.^® Controlling for the individual’s major, I regress respondent i’s beliefs 
about her parents’ approval for major j on her beliefs about the other outcomes associated with j. More 
specifically, I consider the following regression model: 



— 1) — + Aj -|- Q; Xij + /3 



Y,P^Jibc = l) + Y^E,Jid, 



+ £' 






c=l 

,c^4 



q=l 



^®It could be that parents have subjective beliefs about the outcomes that are very different from those of the student. However, I can 
only analyze the relationship the student believes exists between her expectation of parents’ approval and her subjective expectations of the 
various choice-specific outcomes. 
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where 5i is an individual fixed-effect, \j is a field- fixed effect, Xij is a vector of individual-specific controls, and 
/3 is the vector of interest. The results are presented in Table 12. Students’ beliefs about parents’ approval 
for a given major increase in their beliefs of finding a job upon graduation, enjoying work at potential jobs, 
and the social status of jobs. Expectation of parents’ approval for a major increases by nearly 3 points (on a 
scale of zero to 100) if the probability of finding a job upon graduation in that major increases by 10 points. 
This effect is even stronger for students with foreign-born parents: Students believe that switching to a major 
with a 10-point higher probability of getting a job upon graduation is likely to increase parents’ approval 
by nearly 5 points. A positive and significant effect, half in magnitude to that of finding a job, is found for 
the social status of the jobs. Again, the effect is stronger for students with foreign-born parents. The only 
other outcome that affects beliefs about parents’ approval is the expectation of enjoying work at the jobs for 
females. Another notable point is that, for females only, parents’ approval is higher by about 5 points for 
one’s chosen major. 

Males with foreign-born parents expect parental approval of a major to increase by about 12.5 points for a 
unit increase in the social status of the jobs. This result reconciles the earlier finding in section 5.2 of parental 
approval not mattering to males with foreign-born parents. Expectation of parents’ approval has a positive 
relationship with the perceived social status of jobs, and status of jobs is an important outcome only in the 
choice for males with foreign-born parents (column (2) in Table 10); hence, because of colinearity, approval 
of parents does not directly affect the choice of these individuals. 

A second important issue is the extent to which one’s choice of major is influenced by their parents and peer 
group. Several respondents in my survey report having majors that are the same as that of their roommates 
and friends. However, there is a self-selection issue: People often select with whom they associate (Manski, 
1993b). Since rooming assignments are not totally random at Northwestern and there are endogeneity issues 
in how friendships are formed, I cannot analytically study the strength of peer effects in the choice of college 
major. 

There seems to be a positive correlation between the students’ majors and those of their parents. In 
particular, students pursuing a major in Natural Sciences are more likely to have a parent who majored in 
that category. Of the 63 individuals with at least one sibling, 22 major in the same field as their sibling. A 
positive correlation between an individual’s choice of college major with that of her parents or siblings could 
be consistent with either (1) the student having more information about that particular choice by information 
acquisition of the various outcomes from her parents and siblings, and hence choosing that major through an 
indirect effect of parents; (2) direct parental pressure leading an individual toward a particular major choice; 
or (3) a utility gain by studying the same major as that of the parents. Eor example, Zafar (2008) finds 
evidence of the first: Individuals are less likely to make errors in reporting starting salaries of majors if their 
father pursued a major in that category. Moreover, when estimating preferences that incorporate individual 
heterogeneity in section 5.2, demographic characteristics (like birth country of the parents) are found to 
bias preferences for certain outcomes. However, it is not possible to identify the channel through which this 
is happening, i.e., whether beliefs and preferences are subconsciously being formed as a consequence of the 

There is little research on peer effects in crucial education-related decisions such as choice of college major. Sacerdote (2001) does not find 
evidence of peer effects in choice of major among Dartmouth College roommates. De Giorgi et al. (2007) ffnd that Bocconi undergraduates 
are more likely to choose a major when many of their peers make that choice. 
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individual’s interactions with parents, or whether parents are intentionally shaping the beliefs and preferences 
of their children (as in Bisin and Verdier, 2001), or both. Survey respondents were asked to explain the reasons 
for the similarity between their major and that of their parents and siblings. Selected responses are shown in 
section 9.2.1 of the Appendix. All three reasons come up as possible explanations. The responses also show 
instances of peer influence, but in most cases individuals seem to form friendships with similar individuals. 

5.4 Robustness Checks 

The model estimated in section 5.1 assumes that all individuals have homogeneous preferences for various 
outcomes. Individuals with different characteristics are very likely to have different preferences. Moreover, 
the assumption that the random terms {£ik\ are independent for every individual i and choice k might be very 
strong. Though a model with limited heterogeneity in preferences is estimated in section 5.2, any unaccounted 
or unobserved heterogeneity may bias the model estimates. In this section, I specify a random parameters 
logit model to account for these issues (see Revelt and Train, 1998, for a discussion of mixed logit models). 
One could allow heterogeneity in preferences for all outcomes, but I focus on the most important outcomes: I 
consider a model in which the differences in utility levels for graduating with a GPA of at least 3.5, enjoying 
the coursework, gaining approval of parents, enjoying work at the available jobs, and the parameter for social 
status of the available jobs are allowed to vary in the population with a specified distribution. The utility 
that individual i receives from choosing major m is: 

U{Xi, {Pim{br), Pim{dq)}r£{l,..,7},q£{l,..,4:}) 

— ^^r={l,5,7} d^imibr — 1) Alij. + Xys={2,3,4,6} — l)^'U'si 

+ S(j={l,2,4} 1 qddimidq) + ^ •^^Eim[d‘i) + £im 

where Xusi for s = {2, 3, 4, 6 } and 73 , are allowed to vary in the population according to a specified parametric 
distribution, and £im is an iid random term that is extreme value distributed. I denote the vector of parameters 
{Au 2 i, Ausi, Auii, AuQi, 73 j} by /3j, and the density of these parameters /(/3J0) where 0 are the parameters 
of the distribution. The probability of i choosing the major m conditional on is: 



Pr(77?.|/3j ) Pr(77?.|{Tj/,;(l)^), A^/i:(dg)},.^|X,..,7},Qe{l,..,4}; k^Ci^ ) 

6^P(X)r={l,5,7} Pim(br = l)AUr + X)s={2,3,4,6} Pirnibs = IjAUgi + X)g={l,2,4} 1 qPim{dq) + 'J 
5DfceCi ®^P(X)r={l,5,7} Pik{br = l)Artr + X)s={2,3,4,6} Piki^s = + X)g={l,2,4} 1 qPik{dq) + 'y^^En~{ds)) 

The unconditional probability of choosing m is the integral of this conditional probability over all possible 
values of /3j and depends on the parameters 6 of the distribution of /3j. The unconditional probability for i 
choosing m is: 

Pim{0) = J P^{rn\{Pik{br), Eik{dq)}r(z{i^..j},q(z{i^..,4y, k&Ci, fdi )/(/3i |0)d/3j 

This integral is approximated through simulation since it cannot be calculated analytically. For a given value 
of the parameter vector 0, a value of /3j is drawn from its distribution. Using this draw, I calculate the 
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conditional probability. This process is repeated for D draws, and the average is taken as the approximate 
choice probability: 

1 ^ 

Pim{9) = J^ Plc{m\{Pik{br), £'tA:(tiq)}rg{l,..,7},i7g{l,..,3}; k&Ci^ Pi) 

The log-likelihood function ln(Prj ) is approximated by the simulated log-likelihood function Yhi ^^iPi{(^))i 
and the estimated parameters are those that maximize the simulated log-likelihood function. I assume that the 
coefficients for graduating with a GPA of at least 3.5, enjoying the coursework, gaining the approval of parents, 
enjoying work at the available jobs, and valuing the social status of the available jobs are independently log- 
normally distributed.^^ The difference in utility levels for an outcome k that is assumed to vary in the 
population is expressed as Auk = exp{Auk + where fij. is a standard normal deviate. The parameters 

Auk and ak, which represent the mean and standard deviation of log(Attfc), are estimated. The mean and 

2 2 t 

standard deviation of Auk are exp(Attfc -|- and exp(Artfc + ^) * y (6xp(cr|) — 1), respectively. 

Columns (la)-(lc) in Table 13 present the estimates of the mixed logit specification for the model with 
D = 100, 000. Estimates of various outcomes are similar to those obtained in the corresponding model 
with no heterogeneity (column 3 of Table 5). The mean coefficient of enjoying coursework is still largest 
in absolute value and significant. The estimated standard deviations of the (random) coefficients are highly 
significant, indicating that these parameters do indeed vary in the sample. Standard deviations for coefficients 
of graduating in 4 years and social status of available jobs are especially very large, indicating that there is 
substantial heterogeneity in how these outcomes are valued in the sample (consistent with what was also 
found in the previous section). Another point of note is that the mean coefficients in the mixed logit model 
are larger than the corresponding fixed coefficients in Table 5. This is because, in the mixed logit, some of the 
stochastic portion of the utility is captured in (3^ rather than in £j. Since the utility is scaled so that e* has 
the variance of an extreme value, the parameters are scaled down in the standard model relative to the mixed 
logit model (the same result is obtained by Revelt and Train, 1998). The fact that the mean coefficients are 
bigger than the fixed coefficients implies that the random parameters constitute a large share of the variance 
in unobserved utility. 

One might wonder about the extent to which the variation in the parameters in the mixed logit model 
can be explained by including demographic characteristics. Columns (2a) through (2c) in Table 13 present 
estimates of the mixed logit model with demographic variables that were used in the heterogeneous model 
described in section 5.2. The estimates are similar to those in column (1) of Table 10, though they are larger 
in magnitude, which is expected. The standard deviations are still large and significant, which indicates that 
the demographic variables considered in section 5.2 capture only some of the heterogeneity exhibited by the 
individuals. Nonetheless, the fact that the relative magnitude of the estimates is similar to previous results 
is reassuring. 



use a log-normal distribution instead of a normal distribution for these parameters since these are all outcomes that one would expect 
to be desirable to an individual. The normal distribution allows coefficients of both signs and implies that some share of the sample has 
negative coefficients for those outcomes, whether or not it is true. The log-normal assumption ensures that each respondent in the sample 
has a positive coefficient for these outcomes. 
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6 Double-Major Choice Model 



Two main reasons emerge for why students may pursue two majors: First, two majors appropriately differen- 
tiated can provide a broader mix of options than a single major. Second, it might be the case that no single 
major meets the needs of the individual. For example, an individual might be interested in both maximizing 
her income prospects and enjoying the coursework. It could very well be the case that no single major meets 
her needs, but a combination of two majors does. To capture the enhanced options and specialization of 
function that two majors provide, I estimate a separate choice model for double majors. More specifically, I 
assume that the utility of a pair of majors depends on the attributes of each major separately, as well as on 
the attributes of a composite major combining the best of both majors. 

Discussion of the model and estimation can be found in the online Appendix at the author’s webpage. I 
only summarize the main findings here: Double-major individuals have preferences similar to those with single 
majors. Graduating in 4 years, enjoying coursework, and gaining approval of parents are the most important 
outcomes in the choice of a major pair. There is evidence that individuals prefer to choose pairs of majors 
that differ in their chances of graduating in 4 years. Females and males differ in the outcomes they specialize 
in. Females choose major pairs that offer different chances of finding a job, while males choose major pairs 
that are different in gaining the approval of parents and enjoying coursework. On the whole, students with 
double majors pursue their interests at college while taking into account parents’ approval, and they also act 
strategically in their choices by choosing majors that differ in their chances of completion and in finding a 
job upon graduation. 

7 Understanding Gender Differences 

Sections 5 and 6 show that males and females also differ in their preferences for the various outcomes. 
The descriptive analyses in section 3.4 and in Zafar (2008) document the heterogeneity in beliefs for various 
outcomes between the two genders. Though the results of the decomposition metric of equation (9), presented 
in Tables 6 and 11, highlight the gender differences in preferences, it is not clear how much of the gender 
gap in the choice of college majors is driven by differences in preferences and how much is due to differences 
in distributions of subjective beliefs. This distinction is important, since males and females identical in 
their preferences will make different career choices if there are gender differences in beliefs about success in 
different occupations (Breen and Garcia-Penalosa, 2002). Moreover, any policy recommendations will depend 
on whether the gender gap exists because of innate differences or because of social biases and discrimination. 
For example, if the gender gap existed because of gender differences in beliefs about ability and self-confidence, 
then policy interventions like single-sex classes could possibly reduce the gap. In this section, I delve into the 
underlying causes for the gender gap in more detail. 

7.1 Decomposition Analysis 

As a first step, I decompose the gender gap into gender differences in beliefs and preferences. A common way 
to explore differences between groups in a linear framework is to express the difference in the average value 
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of the dependent variable Y as: 



Ym-Yf = [{Xm - Xf)Pm] + [Xf0m - Pf)] 

where Xj is a vector of average values of the independent variables and /3j is a vector of the estimated 
coefficients for gender j G {{M)ale, {F)emale}. The first term on the right-hand side is the inter-group 
difference in mean levels of the outcome due to different observable characteristics, while the second term is 
the difference due to different effects of the characteristics. This technique is attributed to Oaxaca (1973). 
However, in the current context, the probability of choosing a given major, Y, is non-linear. In the case 
Y = F{X(5) and F{.) is a non-linear function, Y does not equal F(Xf3). The gender difference in this 
non-linear case can be written as: 

V _ V — _L IS^^F F'jXFilSM) _ SF^F F{Xpi!3p) -\ 

M ip — lz^i=i Mm 2^i=l Np i LZ^i=l Np X,i=l Np i 



= [F{XmPm) - F{XfPm)] + - HXfPf)] 

where Nj is the sample size of gender The first expression in the square brackets represents part of the 
gender gap that is due to gender differences in distributions of X, and the second expression represents the 
part due to differences in the group processes determining levels of T. It is relatively simple to estimate the 
total contribution. However, identifying the contribution of group differences in specific variables/coefficients 
to the gender gap is not straightforward. For this purpose, I use a decomposition method proposed by Fairlie 
(2005). Contributions of a single variable/coefficient are calculated by replacing the relevant variable of one 
group with that of the other group sequentially, one by one. For illustration, suppose Yj = F(Xjl3j) for 
j={F,M} and that X includes two variables, X\ and X 2 - Moreover, let Nm = Xf = N and assume there 
exists a natural one-to-one matching of female and male observations. The independent contribution of X\ 
to the gender gap is given as: 
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N 



N 



F{XiMiPlM T X2Mil^2M) ~ ^{XlFiPlM T X2Mil^2M) 



2 = 1 



and that of X 2 is given as: 



N 



N 



F{XlFiPlM + X2Mil^2M) ~ ^iXlFiPlM + X2Fi^2M) 



i=l 

Therefore, the contribution of a variable to the gap is equal to the change in the average predicted prob- 
ability from replacing the female distribution with the male distribution of that variable while holding the 
distributions of the other variable constant. One important thing to note is that, unlike in the linear case, 
the independent contributions of X\ and X 2 depend on the value of the other variable. Therefore, the order 
of switching the distributions can be important in calculating the contribution to the gender gap.^^ Similarly, 



equally valid expression is Y m — y F = [F{XmI^p) — F(Xp(3p)] + T(^m/3m) “ F(XmI^p)]- This alternative method provides 
different estimates, which is the familiar index problem with the Oaxaca decomposition technique. 

^^Yun (2004) outlines an alternate decomposition strategy that is free from path-dependency. The method is easier to implement, but I 
don’t use it since it involves a first-order Taylor approximation. Moreover, I believe that the decomposition employed in this paper is closer 
to what is standard in the literature. 
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the independent contribution of (5i to the gap is given by: 
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N 
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+ ^2Fif^2M) ~ ^'i^lFiPlF + ^2Fi!^2M) 

i=\ 



and that of /?2 is given as:^^ 



F{X\FililF + ^2FiI32m) ~ F{XlFiPlF + X2Fil^2F) 

i=l 

For the purposes of this decomposition, I treat double-major respondents as if they were pursuing a single 
major; I use the parameter estimates obtained from the single major choice model using stated preferences 
of the respondents. Results of this decomposition are presented in Table 14 for four different majors.^® The 
last row of the table shows that both expectations and preferences contribute to the gender gap for all major 
categories. The contributions of preferences and beliefs to the gap differ by fields. The majority of the gender 
gap in Literature & Fine Arts and in Social Sciences II is due to gender differences in beliefs, while gender 
differences in preferences explain most of the gap in Engineering and in Social Sciences I. 

If women being less overconfident than men (Niederle and Vesterlund, 2007, and references therein) and 
women being low in self-confidence (Long, 1986; Valian, 1998) were the main explanations for the underlying 
gender gap, one would expect gender differences in beliefs about academic ability to be important in explaining 
the gender difference in major choices. However, columns (l)-(4) of Table 14 show that gender differences in 
beliefs about ability (more precisely, beliefs about graduating in four years and graduating with a GPA of at 
least 3.5) are insignificant and explain a small part of the gender gap. Therefore, explanations based entirely 
on the assumption that women are under-represented in sciences and Engineering because they have lower 
self-confidence can be rejected in my data. Another striking observation is that gender differences in beliefs 
about enjoying coursework in the various fields are significant and explain a large part of the gap. 

Here I discuss the decomposition results for Engineering in some detail. These results are presented in 
columns (1) and (5) of Table 14. The model predicts that, on average, males are nearly twice as likely as 
females to major in Engineering (an average male probability of 0.104 versus 0.045 for females); 60% of this 
gap is due to gender differences in preferences for various outcomes. Moreover, nearly 27% of the gap is due 
to gender differences in beliefs about enjoying coursework. Interestingly, gender differences in beliefs about 
future earnings are insignificant and constitute less than 0.5% of the gap. Eemales have beliefs similar to 
those of males about academic ability in Engineering.^® These findings suggest that females are less likely to 
major in engineering not because they are under-confident about their academic ability, low in self-confidence, 
or believe wage discrimination exists in the labor market. Instead, it is because they believe that they won’t 



this illustration, I have assumed an equal number of observations for females and males. However, my sample has more females than 
males. Since the decomposition requires one-to-one matching of female and male observations, I use the following simulation process. From 
the female sub-sample, I randomly draw 60 samples with the same number of observations as in the male sub-sample. Then I sort the female 
and male data by the predicted probabilities and calculate separate decomposition estimates. The mean value of estimates from the separate 
decompositions is calculated and used to approximate the results from the entire female sample. As in Fairlie (2005), I approximate the 
standard errors using the delta method. 

do not conduct this analysis for the category of Natural Sciences. This is because the category pools both life sciences and physical 
sciences. Traditionally, females are more likely to major in the former and less likely to major in the latter. Since I pool them together, the 
decomposition analysis for the pooled category would not be very useful. 

observe only the beliefs about academic ability, not actual academic ability. However, Chemers et al. (2001) show that confidence 
in one’s ability is strongly related to academic performance. Moreover, it is the beliefs that matter when an individual is making a choice 
under uncertainty. 
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enjoy taking courses in Engineering. 



7.2 Simulations 

This section simulates different environments to see how the gender gap would change under different scenarios. 
Column (1) of Table 15 shows the gender gap predicted by the model for the various major categories. The 
simulation in column (2) considers an environment where the female subjective ability distribution (beliefs 
about graduating within four years and about graduating with a GPA of at least 3.5) is replaced with that 
of males. The purpose of this simulation is to determine how much of the gap is due to females having less 
self-confidence in their ability (relative to men). The second simulation in column (3) replaces the female 
subjective earnings distribution with that of males; it is meant to answer the question of how much of the gap 
is due to beliefs of wage discrimination in the labor market. Columns (4) and (5) simulate an environment 
in which females have the same beliefs as males about enjoying coursework and enjoying work at potential 
jobs, respectively. 

I continue to focus the discussion on Engineering. The results confirm the findings in Table 14. If female 
expectations about ability were raised to the same level as those of males through some policy intervention, 
the gender gap in Engineering would decrease by less than 14%. The gender gap virtually stays the same if 
female expectations of future earnings were forced to be the same as those of males. Einally, the gender gap 
decreases by nearly 50% if the female beliefs about enjoying coursework in Engineering were replaced with 
those of males. These results are in line with the findings of the previous section. The small contribution of 
gender differences in beliefs about ability and future earnings in Engineering toward the underlying gender 
gap in the choice of major allows me to rule out low self-confidence and perceived wage discrimination in 
the labor market as possible explanations for why women are less likely to major in fields like Engineering. 
However, it is not clear what kind of policy would be able to bring about a change in female beliefs about 
enjoying coursework and enjoying working at the jobs because these gender differences could be a consequence 
of innate gender differences in attitudes (Baron-Cohen, 2003) or due to social biases including discrimination 
(Valian, 1998).^® This issue is pursued in more detail in the following section. 

7.3 Understanding Beliefs Abont Enjoying Conrsework and Work 

In a quest to understand why females are less likely to enjoy studying and working in fields like Engineering, 
survey respondents were asked their beliefs about each gender being treated poorly at the jobs that would be 
available in the different major categories. The question was worded as follows^®: 

" What do you think is the percent chance that X (where X = {Male, Female}) would be treated 
poorly in jobs that are available in each of the following fields?" 



sort the female and male sub-samples according to the predicted probability of majoring in that field and then replace the female 
subjective belief about ability with that of the corresponding male. Since there are more females than males, I use a simulation method 
similar to the one used for the Fairlie decomposition. 

^^An example of the latter is that women might believe that these fields are not gender-neutral but constructed in accordance with 
the traditional male role, and that they therefore would be treated poorly in the workplace. For example, Traweek (1988) argues that an 
aggressive behavior is a necessary ingredient for achieving success in science, and Niederle and Vesterlund (2007) show that women tend to 
shy away from competitive environments. In that case, even if women perceive no gender difi'erence in ability and compensation, their beliefs 
about how much they will enjoy studying engineering and science will be affected. 

^^This question was asked in the follow-up survey (Zafar, 2008), which was taken by 117 of the 161 original survey takers. 
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Before providing the responses to this question, columns (1) and (2) of Table 16 report the fraction of 
females that survey respondents believe take classes in the various majors. Column (3) reports the average 
number of females who graduated in the various majors in 2005 and 2006 (source: IPEDS 2005 and IPEDS 
2006). Survey respondents seem to be well informed about the relative fraction of females in the various 
majors. The responses to the question about males and females being treated poorly are shown in columns 
(4)-(7) of Table 16. Several notable patterns stand out. Eirst, male respondents believe that females are 
treated more poorly than males in jobs in all fields except Education, Literature & Eine Arts, and Music 
Studies; these three fields correspond to the three most female-dominated fields (in college) as reported by 
males in column (1) of the table. Second, females believe that they would be treated more poorly than males 
at jobs in all fields except Education- the field that females believe has the highest fraction of females. Third, 
for both the male and female respondents, the largest difference in females being treated poorly relative to 
males is for Engineering and Math & Computer Sciences- two categories with the lowest fraction of females 
(as reported by both males and females). Einally, both males and females believe that Education is the 
category in which males would be treated the worst. 

Table 17 shows a significant correlation of -0.35 between females’ beliefs about being treated poorly at 
the jobs and the fraction of females in the major’s classes. The table also shows separate male and female 
correlation patterns between the variables described in Table 16 and beliefs about enjoying coursework and 
enjoying work at jobs. These correlations indicate that, for female students in particular, beliefs about 
enjoying coursework and enjoying work at the jobs are positively related to beliefs about the fraction of 
females taking classes in that field and negatively correlated with the perceptions of females being treated 
poorly in the jobs.'^*' Interpreting these correlations is not straightforward. It could be that females prefer fields 
that value female-specific attributes and where females are treated more favorably (Cejka and Eagly, 1999, 
find that occupations that are female-dominated are those where female-specific attributes are perceived to 
be essential for success), or it could be that females are treated more favorably at those jobs precisely because 
those are "female" occupations. Unfortunately, with the available data, it’s not possible to choose between 
these competing causal explanations. 

I re-estimate the single-major choice model initially estimated in section 5.1 to see how the inclusion of the 
new variables "females treated poorly at the jobs " and "males treated poorly at the jobs " affect the parameter 
estimates. The estimates for the determinants that were already included in the initial model stay almost the 
same, while the new variables are insignificant.^^ Moreover, the new variables do not improve the model’s 
explanatory power for the entire sample and for females; relative to the initial model, the Wald measure 
of goodness-of-fit that compares the likelihood ratio chi-squared of the model to one with the null model) 
does not change by much. 

The findings in this section suggest that females are less likely to major in fields like engineering not because 
they are under-confident about their academic ability, low in self-confidence, or believe wage discrimination 

^"individuals were asked to explain what "being treated poorly" meant to them. See section 9.2.2 for selective comments. 

Since beliefs of males and females being treated poorly at the jobs are strongly correlated with beliefs about enjoying coursework and 
enjoying work at the jobs (Table 17), it seems that their impact on the choice is already being captured indirectly. Indeed, the variable 
"females treated poorly at the jobs" only shows up significantly (at the 1% and 10% level, respectively) for females and the entire sample in 
a model that excludes both enjoying coursework and enjoying work at the jobs. Model estimates are available upon request. 
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exists in the labor market. Instead, it is because they believe that they won’t enjoy taking courses in 
Engineering. In other words, it’s not that women think they won’t be good engineers; it’s that they think 
they won’t enjoy studying it. The results seem to suggest that a policy that changes social attitudes might 
be more useful in narrowing the gap. 

A question that I do not address is the source of gender differences in preferences, which could arise from 
differences in tastes, as well as gender discrimination. For example, parents who know that females would 
be discriminated against in male-dominated majors/occupations could try to shape the preferences of their 
female children so that they are more comfortable in female-dominated majors/occupations (Altonji and 
Blank, 1999). The question of understanding the sources of gender differences in preferences is beyond the 
scope of this paper. 

8 Conclusion 

Choosing a college major is a decision that has significant social and economic consequences. Little is known 
about how youth choose college majors and why the observed gender gap exists. In this paper, I estimate a 
model of college major choice with a focus on explaining the gender gap. Gender differences in major choice 
are extremely complex, and no simple explanation can be provided for them. The analysis presented in this 
paper attempts to enhance our understanding of these issues. 

On the methodology side, this paper shows that elicited expectations can be used to relax strong and often 
nonverifiable assumptions about expectations to infer decision rules under uncertainty. Descriptive analysis of 
the subjective data shows substantial heterogeneity in beliefs both within and between genders. My approach 
also differs from the literature on major choice by accounting for both the pecuniary and non-pecuniary 
determinants of the choice. I have shown that elicited subjective data can be used to infer decision rules in 
environments where expectations are crucial. This is particularly relevant in cases where the goal is to explain 
group differences in choices under uncertainty and where expectations may differ across groups (in unknown 
ways). 

I estimate models for single-major and double-major choice. Outcomes most important in choice of 
major are enjoying coursework, gaining approval of parents, and enjoying work at the jobs. Non-pecuniary 
determinants explain about half of the choice for males and more than three- fourths of the choice for females. 
Males and females have similar preferences regarding choices at college, but differ in their tastes regarding 
the workplace; females mostly care about non-pecuniary outcomes (gaining approval of parents and enjoying 
work at jobs), while males value pecuniary outcomes (social status of the jobs, likelihood of finding a job, and 
earnings profiles at jobs) more. In addition, I find that students choosing double majors hedge their chances 
of getting a job upon graduation and completing their studies by choosing pairs of majors that differ in these 
two outcomes. Cultural proxies and demographic variables bias beliefs and preferences in systematic ways. 

Another concern that is not directly addressed in this paper is the extent to which beliefs are affected by preferences. In particular, 
cognitive dissonance may cause individuals to report beliefs that are consistent with their choices. One needs to see how beliefs evolve over 
time in order to study this. This issue is studied in detail in Zafar (2008). There are three ways that I check for this: (1) compare beliefs 
with objective measures in cases where it is possible (for example, future income), and see if individuals self-enhance beliefs for outcomes 
associated with their intended major; (2) since some individuals had officially declared their majors while others had an intended major at 
the time of first survey, systematic differences in estimates for the two sub-samples would indicate presence of cognitive dissonance; and (3) 
analyze how individuals revise their beliefs for outcomes associated with the different majors- this requires a panel of beliefs and provides 
the most convincing evidence on the presence of any biases. All three approaches reveal that cognitive issues do not affect the way in which 
individuals report their beliefs. 
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Individuals with foreign-born parents value the pecuniary determinants of the choice more than individuals 
with US-born parents. Indeed, males with foreign-born parents are the only sub-group in my sample who 
value pecuniary determinants more than the non-pecuniary outcomes. 

The analysis in this paper has some limitations. First, the study is based on data from Northwestern 
University only. The heterogeneity in subjective expectations underscores the need to elicit similar data at 
different undergraduate institutions and at a larger scale in order to make policy recommendations. Second, 
heterogeneity in subjective responses could be driven by differential access to information or by different 
information processing. Progress in understanding how people form and update expectations requires richer 
longitudinal data. Moreover, as Manski (2004) argues, understanding expectations formation will also require 
intensive probing of individuals to learn how they perceive environments and how they process new informa- 
tion. Third, individuals may find it optimal to experiment with different majors to learn about their ability 
and match quality (Altonji, 1993; Malamud, 2006; and Stinebrickner and Stinebrickner, 2008). This study 
does not focus on this aspect, assuming instead that individuals maximize current expected utility. 

My results shed some light on the reasons for the gender gap in college major choice. Gender differences in 
beliefs about ability and future earnings are insignificant in explaining the gender gap. A policy intervention 
that were to raise the expectations of females about ability and future earnings in engineering to the same 
level as those of males would decrease the gender gap only by about 15%. This result has two implications: 
(1) just raising the expectations of women may not be enough to eradicate the gap, and (2) hypotheses that 
claim that the gap could be explained by women having low self-esteem and being less overconfident than 
men can be rejected by my data. Most of the gender gap is due to gender differences in beliefs about enjoying 
coursework and different preferences for various outcomes- simply replacing females’ beliefs about enjoying 
coursework with those of the males decreases the gender gap in engineering by almost half. Gender differences 
in beliefs about enjoying coursework as well as in preferences may exist because of differences in tastes or 
because of gender discrimination. Richer data are needed to answer this question. I believe the next natural 
step is to explore how individuals form beliefs. 
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9 Appendix 

9.1 Survey Excerpt 

Practice Questions In some of the survey questions, you will be asked about the PERCENT CHANCE of something 
happening. The percent chance must be a number between zero and 100. Numbers like 2 or 5% indicate “almost no 
chance,” 19% or so may mean “not much chance,” a 47 or 55% chance may be a “pretty even chance,” 82% or so 
indicates a “very good chance,” and a 95 or 98% mean “almost certain.” The percent chance can also be thought of as 
the NUMBER OF CHANCES OUT OF 100. 

We will start with a couple of practice questions. 

1. PRACTICE QUESTION 1: What do you think is the PERCENT CHANCE (or CHANCES OUT 

OF 100) that you will eat pizza for lunch next week? % 

2. PRACTICE QUESTION 2: What do you think is the PERCENT CHANCE (or CHANCES OUT 

OF 100) that you will eat pizza for lunch on Tuesday next week? % 
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Once students had answered the questions, they were given the following instructions 

Note that “pizza for lunch next week” INCLUDES the possibility of “pizza for lunch on Tuesday next week”. Recall 
that: 

PRACTICE QUESTION 1: What do you think is the PERCENT CHANCE (or CHANCES OUT OF 100) that 
you will eat pizza for lunch next week? 

PRACTICE QUESTION 2: What do you think is the PERCENT CHANCE (or CHANCES OUT OF 100) that 
you will eat pizza for lunch on Tuesday next week? 

Since “pizza for lunch next week” INCLUDES the possibility of “pizza for lunch on Tuesday next week”, your answer 
to PRACTICE QUESTION 2 should be SMALLER or EQUAL than your answer to PRACTICE QUESTION 1. 

9.1.1 Questionnaire 

The following set of questions was asked for each of the relevant categories. The questions below were asked for Natural 
Sciences. 

Q1 If you were majoring in Natural Sciences, what would be your most likely major? 

Q2 If you were majoring in Natural Sciences, what do you think is the percent chance that you will successfully 
complete this major in 4 years (from the time that you started college)? (Successfully complete means to complete 
a bachelors) 

NOTE: In answering these questions fully place yourself in the (possibly) hypothetical situation. For example, for 
this question, your answer should be the percent chance that you think you will successfully complete your major in 
Natural Sciences in 4 years IF you were (FORCED) to major in it. 

Q3 If you were majoring in Natural Sciences, what do you think is the percent chance that you will graduate with a 
GPA of at least 3.5 (on a scale of 4)? 

Q4 If you were majoring in Natural Sciences, what do you think is the percent chance that you will enjoy the 
coursework? 

Q5 If you were majoring in Natural Sciences, how many hours per week on average do you think you will need to 
spend on the coursework? 

Q6 If you were majoring in Natural Sciences, what do you think is the percent chance that your parents and other 
family members would approve of it? 

Q7 If you were majoring in Natural Sciences, what do you think is the percent chance that you could find a job (that 
you would accept) immediately upon graduation? 

Q8 If you obtained a bachelors in Natural Sciences, what do you think is the percent chance that you will go to 
graduate school in Natural Sciences some time in the future? 

Q9 What do you think was the average annual starting salary of Northwestern graduates (of 2006) with Bachelor’s 
Degrees in Natural Sciences? 

Now look ahead to when you will be 30 YEARS OLD. Think about the kinds of jobs that will be available for you 
and that you will accept if you successfully graduate in Natural Sciences. 

NOTE that there are some jobs that you can get irrespective of what your Field of Study is. For example, one could 
be a janitor irrespective of their Field of Study. However, one could not get into Medical School (and hence become a 
doctor) if they were to major in Journalism. 

Your answers SHOULD take into account whether you think you would get some kind of advanced degree after your 
bachelors if you majored in Natural Sciences. 

QIO What kind of jobs are you thinking of? 

Qll Look ahead to when you will be 30 YEARS OLD. If you majored in Natural Sciences, what do you think is the 
percent chance that you will enjoy working at the kinds of jobs that will be available to you? 

Q12 Look ahead to when you will be 30 YEARS OLD. If you majored in Natural Sciences, what do you think is the 
percent chance that you will be able to reconcile work and your social life/ family at the kinds of jobs that will 
be available to you? 

Q13 Look ahead to when you will be 30 YEARS OLD. If you majored in Natural Sciences, how many hours per week 
on average do you think you will need to spend working at the kinds of jobs that will be available to you? 

When answering the next two questions, please ignore the effects of price inflation on earnings. That is, assume that 
one dollar today is worth the same as one dollar when you are 30 years old and when you are 40 years old. 
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Q14 Look ahead to when you will be 30 years old. Think about the kinds of jobs that will be available to you and 
that you will accept if you graduate in Natural Sciences. What is the average amount of money that you think 
you will earn per year by the time you are 30 YEARS OLD? 

Q15 Now look ahead to when you will be 40 years old. Think about the kinds of jobs that will be available to you and 
that you will accept if you graduate in Natural Sciences. What is the average amount of money that you think 
you will earn per year by the time you are 40 YEARS OLD? 



9.2 Debriefing 

9.2.1 Peer Effects 



The question was: 

Check all that apply 

1) My (intended) major 

2) My (intended) major 

3) My (intended) major 

4) My (intended) major 

5) My (intended) major 

6) My (intended) major 
1) None of the above 



is 


the 


same 


as 


that 


is 


the 


same 


as 


that 


is 


the 


same 


as 


that 


is 


the 


same 


as 


that 


is 


the 


same 


as 


that 


is 


the 


same 


as 


that 



of one of my parents 
of one of my siblings 
of my freshman-year roommate 
of my current roommate 

of the majority of my best high school friends who went to college 
of the majority of my friends in Northwestern 



Next the respondent was asked: "For each of the options ( 1 through 6) in Question 5 that you have marked, please 
explain the underlying reason for it " 

Some of the selected responses are: 

• I am influenced by my father but not much by friends. 

• My Integrated science major is the same as the majority of my friends, because most of the classes that I take is 
with Integrated science majors. Since we are in class together all the time, we have become good friends. 

• My brother is majoring in Journalism but also Political Science. This played a minor influence on my decision but 
is mostly coincidence that we like the same sort of classes. My freshman year roommate was possibly an influence 
on me, but we generally had the same interests in terms of school subjects from the start. 

• My dad majored in English, is passionate about the subject and is now a college professor who teaches it. He 
loved it, but it was never forced on me, resulting in that i grew to love it as well. And I’m good at it. When 
you’re constantly being grammatically corrected and pushed to think loftier ideas then it kind of becomes second 
nature, a permanent habit. As far as my freshman year roommate, i lived in the Communications Residential 
College. It’s 80% journalism and 19% theater. It was bound to happen. 

• My brothers and I have very similar interests and strengths. 

• My parents have always encouraged me to do well in school, and placed an emphasis on math and the sciences. 
Also, I live in a town of only 20,000 people, but there are two major research facilities in the town. Many of my 
peers were also children of scientists. I have a twin brother who also goes to Northwestern and studies Chemistry 
and German. We probably influenced each other because we’re very close. We both took the German AP, which 
is why both of us have German as a second major (the German major is relatively light, especially if you come in 
already taking third year classes). 

• I am interested in Psychology, and although my parents are not too keen on me studying psychology, that’s what 
I want to to. My mom was also interested in Psych, but she never perused it 

• My major is the same as my parents purely by coincidence. Somehow our interests coincide. My major is the 
same as the majority of my high school friends (but most of my best friends are doing medicine) because most of 
my high school friends who study abroad chose economics. It is also the major which most students from Hong 
Kong would choose when they study abroad since most jobs you can find back home is econ-related. My major is 
the same as the majority of my friends in Northwestern because 1) Economics is a popular major, the probability 
that you can hnd an econ major student is quite high 2) I met most of my friends and formed the friendship 
through classes and extracurricular activities. 

• My mom is a psychologist, and even though I have no desire to pursue that career I think she might have influenced 
my interest in psychology 
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• I grew up in a household where my parents are both scientists so I became interested in medicine and science 
simultaneously. They never told me what to do, it was just a matter of spending more time around a certain field. 
Also, I live on North Campus where a majority of Northwestern science majors and engineers live so it just so 
happens that many people are in the same field that I intend to be in, primarily by location because the dorms 
up North are closer to Tech, which is where most of our classes are held. 

• 1) Parental Influence 5 and 6) Social Integration with Friends of Similar Background 

• For the first, my parents raised me and my siblings, and for the second, I tend to make friends with people I share 
classes with. 

• I think they paired me with a roommate with whom I had stuff in common. My friends at Northwestern and I 
have the same interests and personalities and that is reflected in our majors. 

• My roommate took a Psychology class last year and really enjoyed it. I had never had any exposure to Psychology 
classes in high school, so decided that it would be interesting to take. I took the class this fall, and really enjoyed 
it. 

• My parents and I have similar tastes and I like the things they like. My roommate and I were best friends from 
high school and had very similar interests. 

• I think I want to major in economics because I see how successful my dad is today and since he majored in 
business, I thought economics would be close enough. 

• economics is something that flows for me when i learn it, maybe it’s in my genes since my dad majored in it during 
graduate school, it’s also very practical and covers many bases, so i see why my friends picked it, it’s respected, 
it’s not seen as a slacker major like psychology, and i find it very interesting as i would hope many people do since 
it’s such a popular major 

• I really think it’s a coincidence. My roommate is interested in politics, too. Maybe it’s because we’re from similar 
places. We’re both from coastal cities, where politics is big. 

• My father has influenced me indirectly because he is an economics professor. My brother is young and wants to 
follow me into business, i am friends with a lot of people in my classes, which happen to be econ./MMSS classes 

• My mother is terrible at math so she majored in an all-words major. Sociology, but I am OK at math so my Social 
Policy major incorporates a bit more economic reasoning and logic than hers 

9.2.2 Beliefs of Being Treated Poorly 

Below I present selective comments by gender to the question: What does the phrase "treated poorly in jobs available... " 
mean to you 

Females reported: 

• Women might be subject to jokes in strongly male-dominated fields, but men are more likely to be subject to worse 
treatment by female coworkers in strongly female-dominated fields. Poor treatment of women by men is much less socially 
acceptable than the reverse. 

• It might mean that they were treated unfairly in terms of pay, or it might mean that the demands of the job didn’t allow 
the individual to pursue his/her home/family life. 

• Discriminated against in terms of salary, opportunities, and promotions 

• I consider "treated poorly" to signify the chance of some form of gender discrimination present in a field (obviously would 
differ depending on what major/job from each field of study) 

• Openly discriminated upon, and thought to be less capable or incapable of doing the work 

• Looked down upon, not given respect, given bad hours, given bad assignments (women are often treated poorly more often 
than men.) 

• Having employers expect less of them and give them less responsibility, or outright discrimination. 

• Not as commonly appear in the field as the other sex 

• Getting less money; not being socially accepted in that job; not having the same opportunities for promotion 

• discrimination, not given the opportunity to do things because of their gender, stereotyped be the jobs are usually occupied 
on the other gender. 
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• Preferred less when in competition with someone equally qualified; made fun of for work; must face large gender imbalance 

• It means being treated differently based on gender, as in coworkers’ attitudes towards you, (how seriously they take you) 
or even employer’s treatment (like pay difference, or job expectations) 

• Glass ceiling; lower expectations 

Males reported: 

• To me this means the employee is not treated fairly or simply does not feel comfortable in the work environment. 

• Discrimination in salary and at the workplace, acceptance at workplace, promotion, acceptance into societies and journals, 
respect. 

• Harassment /mistreatment from coworkers and unfair compensation when compared to the opposite sex 

• It means that their employers will be biased in some way or another against them because of their gender, and will show 
it via some negative remark or action. 

• Treated with disrespect or made to work very long hours. Or not given a fair chance for promotion. 

• If a member of the opposite gender is given a higher position than you when you are more qualified for the position. 

• Managers/supervisors have prejudices against the work done by members of a certain gender, judging it unnecessarily 
harshly. 

• Not paid what they’re worth, not given ample opportunity for advancement, discriminated against in hiring, not having a 
job that adequately takes in to account a family life and life outside of work 

• To me it means discriminated against based on gender through different means such as interaction, salary, and respect. 

• People may think, "She’s a woman, she can’t solve these types of problems." 

• Jobs in which the individual is assumed to be less capable than they really are. Jobs which a small percentage of people 
think an individual of that gender shouldn’t be doing. 

• Gender discrimination based on expectations of abilities by gender (like bias against females on engineering and natural 
sciences) 



35 




Gender-Specific Enrollment Distribution 
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Figure 1: Gender Composition of Majors of 1999-2000 Bachelor’s Degree 
Recipients Employed Full-Time in 2001. 
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Figure 2: Average income of 1999-2000 Bachelor’s Degree Recipients Employed 

Full-Time in 2001 by Major. 
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The following is the classification of majors 
into categories: 



Table 1: List of Majors 

h Music Studies ^ 

Jazz Studies 
Music Cognition 
Music Composition 
Music Education 
Music Technology 
Music Theory 
Musicology 
Piano Performance 
String Performance 
Voice and Opera Performance 
Wind and Percussion Performance 



a Natural Sciences 
Biological Sciences 
Chemistry 

Environmental Sciences 
Geography* 

Geological Sciences 
Integrated Science 
Materials Science 
Physics 

b Mathematical and Gomputer Sciences 
Cognitive Science 

Computing and Information Systems 
Mathematics 
Statistics 

c Social Sciences I 
Anthropology 
Gender Studies* 

History 
Linguistics 
Political Science 
Psychology 
Sociology 

d Social Sciences II 
Economics 

Mathematical Methods in Social Sciences* 

e Ethics and Values 
Legal Studies* 

Philosophy 

Religion 

Science in Human Gulture* 

f Area Studies 
African American Studies 
American Studies 

Asian & Middle East Languages & Civilization 
European Studies 
International Studies* 

Slavic Languages and Literatures 

g Literature and Fine Arts 
Art History 

Art Theory and Practice 
Classics 

Comparative Literary Studies 

Drama 

English 

French 

German 

Italian 

Spanish 



i Education and Social Policy ^ 

Human Development and Psychological Services 
Learning and Organizational Change 
Secondary Teaching 
Social Policy 



L Journalism® 
Journalism 



* Adjunct majors. These do not stand alone 

1 Majors in the School of Music 

2 Majors in the School of Education and Social Policy 

3 Majors in the School of Communication 

4 Majors in the McCormick School of Engineering 

5 Majors in the Medill School of Journalism 



j Communication Studies® 
Communication Studies 
Dance 

Human Communication Science 
Interdepartmental Studies 
Performance Studies 
Radio/Television/ Film 
Theatre 

k Engineering"^ 

Applied Mathematics 
Biomedical Engineering 
Chemical Engineering 
Civil Engineering 
Computer Engineering 
Computer Science 
Electrical Engineering 
Environmental Engineering 
Industrial Engineering 
Manufacturing and Design Engineering 
Materials Science& Engineering 
Mechanical Engineering 
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Table 2: Sample Characteristics 







Sample 






Population® 


Characteristics 


All 

Preq. (Percent) 


Single Majors 
Freq. (Percent) 


Double Majors 
Preq. (Percent) 


Preq. (Percent) 


Gender 


Male 


69 (43) 


33 (40) 


36 


(46) 


465 (46) 


Female 


92 (57) 


50 (60) 


42 


(54) 


546 (54) 


Total 


161 


83 


78 




1011 


Ethnicity 


Caucasian 


79 (49) 


40 (48) 


39 


(50) 


546 (54) 


African American 


11 (7) 


7 (8.5) 


4 


(5) 


71 (7) 


Asian 


56 (35) 


27 (33) 


29 


(37) 


232 (23) 


Hispanic 


5 (3) 


2 (2) 


3 


(4) 


61 (6) 


Other 


10 (6) 


7 (8.5) 


3 


(4) 


101 (10) 


Declared Major?^ 


Yes 


90 (56) 


44 (53) 


46 


(59) 


182 (18) 


No 


71 (44) 


39 (47) 


32 


(41) 


829 (82) 


International Std?° 


Yes 


8 (5) 


5 (6) 


3 


(4) 


40 (4) 


No 


153 (95) 


78 (94) 


75 


(96) 


971 (96) 


Second-Gen Imm?'^ 


Yes 


66 (41) 


33 (40) 


33 


(42) 


— 


No 


95 (59) 


50 (60) 


45 


(58) 


— 


Average GPA 


Male 


3.48 


3.43 


3.52 


3.26 


Female 


3.40 


3.39 


3.45 


3.31 



a Population Statistics for the sophomore class. (Source: Northwestern Office of the Registrar) 
b Whether the respondent has declared their major at the time of the survey 
c Whether the respondent is an international student 

d Whether at least one of the respondent’s parents is foreign-born, and respondent is US-born 



Table 3: Distribution of WCAS Majors 



WCAS Majors® 


All 

Preq (%) 


Sample'’ 
Males 
Preq (%) 


Females 
Freq (%) 


Class of 2006® 
All Males 

Freq (%) Preq (%) 


Females 
Preq (%) 


Natural Sciences 


31 


(19) 


15 


(22) 


16 


(17) 


156 


(14) 


62 


(12.5) 


94 


(15.5) 


Math & Computer Sci. 


4 


(2.5) 


2 


(3) 


2 


(2) 


37 


(3.5) 


29 


(6) 


8 


(1) 


Social Sciences I 


41 


(25.5) 


12 


(17) 


29 


(31.5) 


512 


(46.5) 


211 


(42.5) 


301 


(49) 


Social Sciences II 


48 


(30) 


29 


(42) 


19 


(21) 


217 


(20) 


140 


(28.5) 


77 


(13) 


Ethics and Values 


4 


(2.5) 


4 


(6) 


0 


(0) 


25 


(2) 


14 


(3) 


11 


(2) 


Area Studies 


13 


(8) 


5 


(7) 


8 


(9) 


24 


(2) 


4 


(1) 


20 


(3) 


Literature & Fine Arts 


20 


(12.5) 


2 


(3) 


18 


(19.5) 


132 


(12) 


32 


(6.5) 


100 


(16.5) 


Total 


161 


(100) 


69 


(100) 


92 


(100) 


1103 


(100) 


492 


(100) 


611 


(100) 



a Majors that appear in each category are listed in Table la 

b In cases where the survey respondent has more than one major in WCAS, only the first one is included 
c Only includes students with a primary WCAS major (Source: Integrated Postsecondary Education Data System) 
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Table 4: Percent Chance of graduating with a GPA of at least 3.5 if majoring in: 

Engineering Lit. &: Fine Arts 




Table 5: Single Major Choice- Estimation of Homogeneous Preferences 



Using Stated Choice 



Using stated Preference 



Arti for graduating within 4 years 
Arti for graduating within 4 years x female 
Au 2 for graduating with a GPA of at least 3.5 
Art2 for graduating with a GPA of at least 3.5 x female 
Art3 for enjoying the coursework 
Arts for enjoying the coursework x female 
7s for hours/week spent on coursework 

for hours/week spent on coursework x female 
Art4 for approval of parents and family 
Art4 for approval of parents and family x female 
Arts for finding a job upon graduation 
Arts for finding a job upon graduation x female 
Arts for enjoying work at the available jobs 
Arts for enjoying work at the available jobs x female 
Arty for reconciling family and work at available jobs 
Arty for reconciling family and work x female 
72 for hours/week spent at work 

72 for hours/week spent at work x female 

73 for the social status of the available jobs“ 

7s for the social status of the available jobs x female 

74 for expected Income at the age of 30 

74 for expected Income at the age of 30 x female 
Log-Likelihood 

No. of Observations 

* significant at 10%; ** significant at 5%; *** significant at 1%; robust standard errors in parentheses 
a - social status is on a scale of 1-8 (8 being the highest social status); normalized to be between 0. 1-0.8 
all other variables (except income) are probabilities between 0 and 1 



0) 




(3) 


(4^ 


6.84*** 


1.65 


-0.447 


-1.54* 


(1.78) 


(2.93) 


(0.868) 


(0.80) 


54.27*** 




3.15** 




(6.63) 




(1.37) 


-3.83*** 


-1.95 


0.903* 


1.13* 


(1.11) 


(1.94) 


(0.520) 


(0.67) 


-8.44** 


0.048 




(4.03) 




(1.12) 


13.11*** 


9.93** 


2.69*** 


2.06*** 


(2.47) 


(4.36) 


(0.45) 


(0.70) 


11.36 


1.43 




(8.39) 




(0.946) 


-0.058*** 


-0.057** 


0.012 


0.0064 


(0.017) 


(0.028) 


(0.011) 


(0.0135) 


-0.045 




0.0189 




(0.071) 




(0.021) 




1.74 


1.37** 


0.98 


(1.16) 


(3.14) 


(0.56) 


(0.75) 


1.71 


1.03 




(3.98) 




(I- 


2.27* 


4.01* 


-0.076 


0.279 


(1.20) 


(2.17) 


(0.512) 


(0.829) 


0.74 




-0.863 




(4.15) 






6.65*** 


1.80 


1.59*** 


0.468 


(2.05) 


(3.21) 


(0.384) 


(0.526) 




18.86*** 




1.80** 




(7.01) 




(0.817) 


-1.93* 


-1.31 


0.241 


0.258 


(1.11) 


(2.77) 


(0.539) 


(0.671) 


-2.36 




0.181 




(4.66) 




(0.946) 


-0.0066 


0.0282 


-0.0080 


-0.015 


(0.0166) 


(0.038) 


(0.0099) 


(0.015) 


-0.073 


0.024 




(0.082) 




(0.018) 


3.27*** 


4.01* 


1.09*** 


(1.12) 


(2.28) 


(0.32) 


(0.53) 


-0.59 


-1.696** 




(4.08) 




(0.662) 


-5.25e- 7 


9.43e - 6 


6.43e - 7 


1.13e-6 


(4.25e- 6) 


(7.91e-6) 


(1.02e- 6) 


(2.43e - 6) 


-19.1e-6 


-4.40e - 7 




(21.8e-6) 




(2.53e - 6) 


-56.58 


-40.77 


-733.52 


-703.255 


83 


83 


83 


83 



40 




Table 6: Decomposition Analysis 





All 


Male 


Female 




(1) 


(2) 


(3) 


Panel A: Estimates Using Stated Choice Data 








Attributed to: 








Pecuniary Attributes 


24.30% 


49.00% 


33.90% 


Non-Pecuniary Attributes 


75.70% 


51.00% 


66.10% 


Attributed to: 








Parents’ Approval + Enjoying Coursework 


44.95% 


40.35% 


39.95% 


Coursework hrs/week + GPA + Graduating in 4 yrs 


22.20% 


7.10% 


22.50% 


Finding a job + Job hrs/week + Income at 30 + Status of Job 


22.05% 


47.00% 


13.35% 


Reconcile work & family + Enjoying Work 


10.80% 


5.55% 


24.20% 


Panel B: Estimates using Stated Preference 








Attributed to: 








Pecuniary Attributes® 


27.90% 


53.80% 


18.20% 


Non-Pecuniary Attributes^ 


72.10% 


46.20% 


81.80% 


Attributed to: 








Parents’ Approval + Enjoying Coursework 


43.50% 


34.00% 


44.00% 


Coursework hrs/week + GPA + Graduating in 4 yrs 


20.40% 


10.30% 


11.85% 


Finding a job + Job hrs/week + Income at 30 + Status of Job 


20.10% 


48.30% 


16.05% 


Reconcile work & family + Enjoying Work 


16.00% 


7.40% 


28.10% 



a Pecuniary attributes are the following outcomes pooled together: Graduating in 4 years; 

Graduating with a GPA of at least 3.5; hrs/week spent on coursework; Finding a job upon graduation; 
Job hrs/week; Income at 30; Status of the available jobs. 
b The non-pecuniary attributes include all outcomes not included in a 
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Table 7: Thought Experiments 





Natural 

Sciences 


Math & 
Comp Sc 


Social 
Sc. I 


Social 
Sc. 11 


Ethics 
& Values 


Area 

Studies 


Lit. fc 
Fine Arts 


Eng. 


Baseline Model 
Avg. Male Prob. for: 
Avg. Female Prob. for: 


0.189 

0.151 


0.090 

0.062 


0.171 

0.226 


0.189 

0.112 


0.094 

0.106 


0.082 

0.140 


0.068 

0.156 


0.117 

0.047 


Expt 1: 10% INCREASE 


% Change in the probability of majoring if: 

in probability of graduating with a CPA of at least 3.5 in Engineering 




Avg. Male Prob. for:“ 
Avg. Female Prob. for: 


-0.93% 

-0.49% 


-1.07% 

-0.49% 


-0.75% 

-0.18% 


-0.97% 

-0.44% 


-0.70% 

-0.24% 


-0.68% 

-0.21% 


-0.71% 

-0.16% 


6.46% 

5.86% 


Expt 2: 10% DECREASE 


in probability of graduating with 


a CPA of at least 3.5 i 


in Literature and Fine Arts 


Avg. Male Prob. for: 
Avg. Female Prob. for: 


0.54% 

0.76% 


0.50% 

0.88% 


0.66% 

1.35% 


0.50% 

1.07% 


0.77% 

1.53% 


0.70% 

1.61% 


-6.56% 

-7.89% 


0.49% 

0.79% 


Expt 3:10% INCREASE in probability of finding 


a job after graduating 


in Social Sciences I 






Avg. Male Prob. for: 
Avg. Female Prob. for: 


-0.27% 

0.56% 


-0.23% 

0.60% 


1.41% 

-2.35% 


-0.29% 

0.57% 


-0.32% 

0.81% 


-0.36% 

0.81% 


-0.33% 

0.79% 


-0.25% 

0.47% 


Expt 4: 10% INCREASE 


in probability of approval of parents for Social Sciences I 








Avg. Male Prob. for: 
Avg. Female Prob. for: 


-1.12% 

-2.52% 


-0.99% 

-2.72% 


6.00% 

10.50% 


-1.22% 

-2.49% 


-1.47% 

-3.69% 


-1.56% 

-3.53% 


-1.51% 

-3.60% 


-1.05% 

-2.06% 


Expt 5: 10% INCREASE 


in probability of enjoying coursework in Engineering 








Avg. Male Prob. for: 
Avg. Female Prob. for: 


-1.57% 

-1.53% 


-1.61% 

-1.83% 


-1.29% 

-0.57% 


-1.67% 

-1.72% 


-1.21% 

-0.70% 


-1.13% 

-0.72% 


-1.21% 

-0.49% 


11.04% 

19.23% 


Expt 6: 10% DECREASE 


in the social status of jobs available after graduating in 


Social Sciences II 




Avg. Male Prob. for: 
Avg. Female Prob. for: 


2.50% 

0.23% 


3.09% 

0.41% 


2.86% 

0.24% 


-11.50% 

-2.14% 


2.31% 

0.26% 


2.29% 

0.27% 


2.47% 

0.23% 


3.14% 

0.48% 



a each cell corresponds to the percent change in the probability of majoring in that category after 
the intervention relative to the baseline case. 
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Table 8: Stated reasons for choosing a major 



How imp. were the following reasons in choosing a major:“ 


Males 


Females 


My parents wanted me to 


6.02 


5.33 




(9.60) 


(11.13) 


A mentor/ role model encouraged me to 


7.31* 


4.27 




(12.00) 


(7.99) 


My siblings made the same choice 


1.80** 


0.29 




(5.45) 


(1.17) 


My high school friends and peers made the same choice 


1.43 


1.21 


(3.51) 


(5.27) 


The societal reputation of the choice 


7.75 


7.71 




(10.01) 


(11.74) 


To be able to get a high-paying job 




7.92 




(11.80) 


(10.43) 


To be able to get a job where I could balance work & family 


8.76* 


6.06 


(8.92) 


(7.64) 


To be able to get a job in a field where people of my gender 


Q** 


0.80 


are not discriminated against 




(2.34) 


To get a job that I would enjoy 


18.68* 


23.15 




(13.73) 


(15.40) 


To get training for a specific career 


7.24 


7.57 




(9.69) 


(9.19) 


To learn more about things that interest me 


18.96** 


25.44 


(16.23) 


(18.16) 


To be able to do well in the coursework of the major 


7.05 


8.45 


(7.72) 


(9.48) 


Fraction of ppl of my gender teaching classes in the major 


0 


0.18 

(0.89) 


Fraction of people of my gender taking classes in the major 


0 


0.076 

(0.62) 


Fraction of ppl of my gender in jobs related to the major 


0.29 


0.15 




(1.19) 


(0.86) 


Other Reasons 


0.69 


1.36 




(4.90) 


(6.53) 



* gender diff is significant at 10%; ** sig at 5%; *** sig at 1% (2-tailed T-test) 
Standard deviation in parentheses 

Each cell is the AVERAGE contribution of the reasons for the choice of majors 



“The exact question was: "In deciding your major, how important to you was each of the 
following reasons? For this question you need to assign an integer between 0 and 100 
to each of the following reasons. Moreover, the responses SHOULD ALL SUM TO 100." 
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Table 10: Estimation of heterogeneous preferences using Stated Preference 



Entire Sample 



Males 



Females 



Aui for graduating within 4 years 

Au 2 for graduating with a GPA of at least 3.5 

Arts for enjoying the coursework 

for hours/week spent on coursework“ 

Au 4 for parents approv x parents supp‘d x (1-Foreign®) 

Att4 for parents approval x parents’_support x Foreign 

Arts for Ending a job upon graduation 

Atte for enjoying work at the available jobs 

Awy for reconciling family and work at available jobs 

Awy for reconciling family & work x divorced-^ 

72 for hours/week spent at work‘d 

73 for social status of the available jobs^ x (1-Foreign) 
^ for social status of jobs x Foreign 

7|^^ for exp. Inc at 30 x (1- low_inc®) x (1-Foreign) 
74^^ for exp Inc at 30 x (l-low_income) x Foreign 
74^ for exp. Income at 30 x low_inc x (1-Foreign) 

74^ for expected Income at 30 x low_inc x Foreign 

Log-Likelihood 
No. of groups 





(2) 


(3) 


-0.545 


-0.958 


1.20 


(0.791) 


(0.911) 


(1.21) 


0.752 


0.751 


1.01 


(0.575) 


(0.721) 


(1.01) 


2.92*** 


2 49 *** 


3.57*** 


(0.466) 


(0.754) 


(0.658) 


0.0152 


0.0098 


0.0232 


(0.011) 


(0.014) 


(0.016) 


0.340** 


0.578*** 


0.262 


(0.150) 


(0.217) 


(0.194) 


0.0439 


-0.147 


0.601** 


(0.159) 


(0.205) 


(0.246) 


0.205 


0.680 


-0.536 


(0.494) 


(0.759) 


(0.637) 


1.51*** 


0.319 


2 24*** 


(0.414) 


(0.611) 


(0.678) 


0.246 


0.700 


0.547 


(0.579) 


(0.747) 


(0.847) 


-0.357 


0.494 


-0.613 


(0.864) 


(1.26) 


(1.32) 


-0.0097 


-0.0044 


0.0045 


(0.0100) 


(0.016) 


(0.012) 


0.310 


1.30* 


0.297 


(0.432) 


(0.76) 


(0.546) 


2.28*** 


3.27*** 


0.817 


(0.550) 


(0.93) 


(0.580) 


!.66e — 06 


3.08e - 06 


17.5e-0( 


!.75e - 06) 


(2.80e - 06) 


(12.5e-0( 


8.16e- 07 


-ll.le- 06 


7.13e-0' 


!.33e - 06) 


(8.07e - 06) 


(7.28e - 0( 


..06e-07 


-3.89e - 06 


1.02e-0( 


!.39e - 06) 


(3.54e - 06) 


(2.58e - 0( 


).64e — 06 


11.3e-06** 


1.40e-0( 


L55e - 06) 


(5.42e - 06) 


(5.78e - 0( 


-726.19 


-287.61 


-401.68 


83 


33 


50 



f Estimates correspond to the estimation of a logit model on stated preference data 
* significant at 10%; ** significant at 5%; *** significant at 1%; robust standard errors in parentheses 
a {b) - number of hours spent per week on coursework (job) varies between 0 and 100; 
c - social status is on a scale of 1-8 (8 being the highest social status); normalized to be between 0. 1-0.8 
all other variables (except income) are probabilities between 0 and 1 

d - parents’ support = 1 if no education expenses are paid by parents; = 2 if they pay less than $5,000; 

= 3 if they pay between $5,000- $10,000; = 4 if they pay between $10,000- $15,000; = 5 if they pay between 
$15,000-$25,000; = 6 if they pay $25,000-b 

e - Foreign is a dummy that equals 1 if either of the respondent’s parents is foreign-born. 

/ - divorced = 1 if respondent’s parents are divorced or separated; zero otherwise 
g - low_income = 1 if parents’ annual income is less than $150,000; zero otherwise 
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Table 11: Decomposition Analysis 





Foreign-Born 


No Foreign-Born 




Parents 


Parents 




Males 


Females 


Males 


Females 




(1) 


(2) 


(3) 


(4) 


Attributed to: 
Pecuniary Attributes 


71.40% 


35.40% 


27.60% 


12.20% 


Non-Pecuniary Attributes 


28.60% 


64.60% 


72.40% 


87.80% 



Attributed to: 

Parents’ Approval -I- Enjoying Coursework 


25.25% 


46.90% 


56.55% 


51.80% 


Coursework hrs/week + GPA + Graduating in 4 yrs 


2.85% 


15.30% 


5.20% 


8.20% 


Finding a job -b Job hrs/week -b Income at 30 -b Status of Job 


65.90% 


26.70% 


28.95% 


11.80% 


Reconcile work & family -b Enjoying Work 


6.00% 


21.10% 


9.30% 


28.20% 



a Pecuniary attributes are the following outcomes pooled together: Graduating in 4 years; Graduating with a GPA of at least 3.5; 
hrs/week spent on coursework; Finding a job upon graduation; Job hrs/week; Income at 30; Status of the available jobs. 
b The non-pecuniary attributes include all outcomes not included in a 



Table 12: Best Linear Predictor of Expectation of Parent’s Approval 
Dependent Variable:Expectation of Parent’s Approval^ 

Entire Sample Males Females 

Estimates Std. Error Estimates Std. Error Estimates Std. Error 





(1) 


(2) 


(3) 


(4) 


(5) 


(3) 


Expectation of:“ 

Social Status of jobs x (1- Parents_foreign^) 


0.084** 


(0.035) 


0.0611 


(0.0622) 


0.090** 


(0.043) 


the status of the jobs x Parents foreign 


0.188*** 


(0.047) 


0.125* 


(0.091) 


0.228*** 


(0.064) 


graduating with a GPA of at least 3.5 


-0.0466 


(0.0467) 


-0.003 


(0.078) 


-0.073 


(0.056) 


graduating in 4 years 


0.0798 


(0.067) 


0.068 


(0.096) 


0.069 


(0.092) 


enjoying coursework 


0.0013 


(0.0013) 


0.00046 


(0.0019) 


0.0016 


(0.0018) 


enjoying work at the jobs 


0.114*** 


(0.041) 


0.063 


(0.0660) 


0.145*** 


(0.053) 


finding a job upon graduation 


0.289*** 


(0.067) 


0.279** 


(0.122) 


0.303*** 


(0.071) 


finding a job x Parents foreign 


0.207** 


(0.082) 


0.219* 


(0.124) 


0.202* 


(0.110) 


Income at 30 (in 10,000s) 


0.000023 


(0.00112) 


0.0023 


(0.0035) 


-0.0006 


(0.0009) 


Income at 30 (in 10,000s) x Low Income”’ 


0.0018 


(0.0022) 


-0.00082 


(0.0048) 


0.0028* 


(0.0015) 


Mother studied given major”* 


0.024 


(0.018) 


0.051 


(0.031) 


0.0055 


(0.02) 


Father studied given major® 


0.032** 


(0.015) 


0.0364* 


(0.022) 


0.024 


(0.022) 


Studying Given Major-^ 


0.0357*** 


(0.013) 


0.021 


(0.021) 


0.048*** 


(0.016) 


Respondent Fixed-Effects 




Yes 




Yes 




Yes 


Major-Specific Dummies 




Yes 




Yes 




Yes 


No. of Observations 




1287 




551 




736 


No. of Glusters 




161 




69 




92 



f Dependent variable is a response 0-1 to: "If you were majoring in [X], what do you think is the percent 
chance that your parents and other family members would approve of it?" 

All regressions include major-specific dummies, and respondent fixed effects. (Constants not shown) 

Parameter estimates correspond to the estimation of OLS model. Cluster errors in parentheses 
* significant at 10%; ** significant at 5%; *** significant at 1% 

a Expectations of outcomes except income are between 0 and 1; status is discrete on a scale of 0-0.9 
b a dummy that equals one if either of the respondent’s parents is foreign-born 
c a dummy that equal one if respondent’s parents’ annual earnings are less than $150,000 
d a dummy that equals one if mother’s field of study is the same as the relevant question 
e a dummy that equals one if father’s field of study is the same as the relevant question 

/ a dummy that equals one if the respondent’s intended major category is same as category X in the question 
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Table 13: Mixed Logit Model Estimation 

Without Demographics With Demographics 
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Log-Likelihood —698.00 —690.84 

No. of Groups 83 83 

No. of Observations 2957 2957 

a Mean and Standard Deviation of the Log-Normally distributed coefficients are calculated at the estimated Au and cr. 
* significant at 10%; ** significant at 5%; *** significant at 1%; standard errors in parentheses 




Table 14: Decomposition Analysis to explain gender differences 
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All included variables 0.0251 - 0.0523 0.0711 - 0.0082 0.0350 - 0.0320 0.0196 - 0.0328 

41.75% 62. .01% 78.36% 20.00% 58.25% 37.99% 21.64% 80.00% 

Standard errors (computed by delta method) in parentheses. *** significant at 1 %; ** significant at 5 %; * significant at 10 %; 
a. The contribution in % of the relevant variable to the gap 




Table 15: Simulations of the Gender Gap under different Environments 



Fields of Study 


Base° 


Ability 


Income 


Enjoying 

Goursework 


Enjoying 

Work 




(1) 


(2) 


(3) 


(4) 


(5) 


Engineering 


0.0602“ 


0.0517 

13.92%'’ 


0.0608 

-1.06% 


0.0308 

48.74% 


0.0534 

11.18% 


Natural Sciences 


0.0550 


0.0445 

18.98% 


0.0529 

3.88% 


0.0229 

58.29% 


0.0406 

26.48% 


Math & Gomputer Sci. 


0.0191 


0.0135 

29.07% 


0.0184 

3.45% 


0.0074 

61.41% 


0.0083 

56.38% 


Social Sciences I 


-0.0412 


-0.0524 

-27.28% 


-0.0474 

-15.32% 


-0.0643 

-56.25% 


-0.0613 

-48.84% 


Social Sciences II 


0.0907 


0.0737 

18.68% 


0.0881 

2.88% 


0.0272 

69.92% 


0.0608 

32.92% 


Ethics & Values 


-0.0189 


-0.0266 

-40.77% 


-0.0219 

-15.87% 


-0.0419 

-122.03% 


-0.0381 

-101.9% 


Area Studies 


-0.0624 


-0.0634 

-1.69% 


-0.0655 

-4.96% 


-0.0563 

9.87% 


-0.0721 

-15.48% 


Lit. & Fine Arts 


-0.0843 


-0.0863 

-2.35% 


-0.0888 

-5.35% 


-0.0545 

35.34% 


-0.0777 

7.84% 



a The model predicted gender gap (male prob. - female prob.) under the relevant environment 
b The % decrease in the gender gap (relative to the baseline case) after the change 
c The predicted gap under the baseline case, i.e. no intervention 



Table 16: Perceptions of Monetary and Non-Monetary Discrimination 



Category 


% Females in class“ 


Males Poorly'' 


Ferns Poorly 


Males“ 


Ferns’’' 


Recs“ 


Males 


Ferns 


Males 


Ferns 




(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


Natural Sciences 


40.50 


39.22 


57.32 


8.49 


7.27 


24.03 


22.90 


Math & Gomp Sci 


31.50** 


25.22 


34.12 


7.17 


6.53 


23.71 


29.19 


Social Sciences I 


56.56* 


60.30 


61.72 


8.55 


11.56 


12.71 


14.75 


Social Sciences II 


43.13 


42.83 


34.97 


8.45 


7.27 


19.90 


27.72 


Ethics and Values 


55.39 


55.98 


39.18 


9.56 


11.01 


12.07 


15.71 


Area Studies 


59.84 


58.15 


77.27 


10.52 


9.87 


11.07 


13.19 


Lit & Fine Arts 


64.82 


66.16 


73.11 


11.19 


11.22 


8.94 


13.15 


Music Studies 


59.25 


57.22 


50.97 


13.05 


10.63 


9.25 


13.16 


Educ & Social Policy 


66.21 


68.86 


76.24 


13.47 


16.57 


9.82 


13.18 


Gommunication Std 


58.71 


59.72 


57.88 


10.82 


11.90 


11.98 


15.43 


Engineering 


30.01 


27.28 


27.10 


6.09 


5.34 


25.61 


30.77 


Journalism 


58.90 


60.22 


71.42 


10.47 


11.69 


12.03 


16.30 



*** gender diff sig (p-value < 0.01; two-tailed t-test); ** diff sig at 5% level; * diff sig at 10% level 
“ Fraction of students in the major who are females (on a scale of 0-100) 

^ The average belief that males would be treated poorly in the jobs that would be 
available in each of the specified categories 

Response of male (female) survey respondents to the relevant question 
® Fraction of females amongst graduates with that major in 2005 & 2006 (source: IPEDS) 



49 




Table 17: Correlation patterns between various beliefs 





Females 


Males 


% 


Enjoy 




treated 


treated 


Females 


Course- 




poorly 


poorly 


in class 


work 






Females 




Females treated poorly 


1.00 








Males treated poorly 


0.3997*** 


1.00 






% of Females in the class 


-0.3523*** 


0.2191*** 


1.00 




Enjoy coursework 


-0.2616*** 


0.0926*** 


0.4654*** 


1.00 


Enjoy working at jobs 


-0.1583*** 


-0.0205 


0.2699*** 


0.6196*** 






Males 




Females treated poorly 


1.00 








Males treated poorly 


0.5575*** 


1.00 






% of Females in the class 


-0.3584*** 


0.0692 


1.00 




Enjoy coursework 


-0.1604*** 


-0.0483 


0.0832* 


1.00 


Enjoy working at jobs 


-0.0741 


-0.0915* 


0.0293 


0.6704*** 



correlation significant nsing the Spearman’s rank correlation coefficient: 
*** sig at 1%, ** sig. at 5%, * sig. at 10% 
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